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ABSTRACT 

The combination of 7-year data from WMAP and improved astrophysical data rigorously tests 
the standard cosmological model and places new constraints on its basic parameters and extensions. 
By combining the WMAP data with the latest distance measurements from the Baryon Acoustic 
Oscillations (B AO) in the distrib ution of galaxies (Percival ct al. 2009) and the Hubble constant (Hq) 
measurement (Ries s et alj 120091) . we determine the parameters of the simplest 6-parameter ACDM 
model. The power-law index of the primordial power spectrum is Ug = 0.968 ± 0.012 (68% CL) 
for this data combination, a measurement that excludes the Harrison-Zel'dovich-Peebles spectrum by 
99.5% CL. The other parameters, including those beyond the minimal set, are also consistent with, and 
improved from, the 5-year results. We find no convincing deviations from the minimal model. The 7- 
year temperature power spectrum gives a better determination of the third acoustic peak, which results 
in a better determination of the redshift of the matter-radiation equality epoch. Notable examples of 
improved parameters are the total mass of neutrinos, 'Y^rrii, < 0.58 eV (95% CL), and the effective 
number of neutrino species, N^s = 4.34j^g'gg (68% CL), which benefit from better determinations 
of the third peak and Hq. The limit on a constant dark energy equation of state parameter from 
WMAP-l-BAO-l-iJo, without high-redshift Type la supernovae, is w; = -1.10 ± 0.14 (68% CL). We 
detect the effect of primordial helium on the temperature power spectrum and provide a new test of 
big bang nucleosynthesis by measuring Yp = 0.326 ± 0.075 (68% CL). We detect, and show on the 
map for the first time, the tangential and radial polarization patterns around hot and cold spots of 
temperature fiuctuations, an important test of physical processes at z = 1090 and the dominance 
of adiabatic scalar fluctuations. The 7-year polarization data have significantly improved: we now 
detect the temperature- £^-mode polarization cross power spectrum at 21ct, compared to IBcr from 
the 5-year data. With the 7- year temperature- i?-mode cross power spectrum, the limit on a rotation 
of the polarization plane due to potential parity-violating effects has improved by 38% to Aa = 
— 1.1° ±1.4° (statistical) ±1.5° (systematic) (68% CL). We report significant detections of the Sunyaev- 
Zel'dovich (SZ) effect at the locations of known clusters of galaxies. The measured SZ signal agrees 
well with the expected signal from the X-ray data on a cluster-by-cluster basis. However, it is a factor 
of 0.5 to 0.7 times the predictions from "universal profile" of Arnaud et al., analytical models, and 
hydrodynamical simulations. We find, for the first time in the SZ effect, a significant difference between 
the cooling- flow and non-cooling-fiow clusters (or relaxed and non- relaxed clusters) , which can explain 
some of the discrepancy. This lower amplitude is consistent with the lower-than-theoretically-expected 
SZ power spectrum recently measured by the South Pole Telescope collaboration. 
Subject headings: cosmic microwave background, cosmology: observations, early universe, dark matter, 
space vehicles, space vehicles: instruments, instrumentation: detectors, telescopes 



^ WMAP is the result of a partnership between Princeton Uni- 
versity and NASA's Goddard Space FUght Center. Scientific 
guidance is provided by the WMAP Science Team. 

^ Texas Cosmology Center and Department of Astronomy, 
University of Texas, Austin, 2511 Speedway, RLM 15.306, 
Austin, TX 78712; komatsu@astro.as.utexas.edu 

^ Dept. of Astrophysical Sciences, Peyton Hall, Princeton Uni- 
versity, Princeton, NJ 08544-1001 

* Astrophysics, University of Oxford, Keble Road, Oxford, 
OXl 3RH, UK 

^ Dept. of Physics & Astronomy, The Johns Hopkins Univer- 
sity, 3400 N. Charles St., Baltimore, MD 21218-2686 

« Code 665, NASA/Goddard Space Flight Center, Greenbelt, 
MD 20771 

''' Dept. of Physics, Jadwin Hall, Princeton University, Prince- 
ton, NJ 08544-0708 

* Canadian Institute for Theoretical Astrophysics, 60 St. 
George St, University of Toronto, Toronto, ON Canada M5S 
3H8 

® Princeton Center for Theoretical Physics, Princeton Univer- 



sity, Princeton, NJ 08544 

" Dept. of Physics and Astronomy, University of British 
Columbia, Vancouver, BC Canada V6T IZl 

11 ADNET Systems, Inc., 7515 Mission Dr., Suite AlOO Lan- 
ham, Maryland 20706 

12 Columbia Astrophysics Laboratory, 550 W. 120th St., Mail 
Code 5247, New York, NY 10027-6902 

1^ Depts. of Astrophysics and Physics, KICP and EFI, Uni- 
versity of Chicago, Chicago, IL 60637 

1** Dept. of Physics, Brown University, 182 Hope St., Provi- 
dence, RI 02912-1843 

15 UCLA Physics & Astronomy, PO Box 951547, Los Angeles, 
CA 90095-1547 



2 



Komatsu et al. 



1. INTRODUCTION 



A simple cosmological model, a flat universe with 
nearly scale-invariant adiabatic Gaussian fluctuations, 
has proven to be a remarkably good fit to ever im- 
proving cosmic microwave background (CMB) data 
(iHinshaw et al.ll2009HR"eichardt et al.ll2009HBrown et al l 



2009f) ■ large-scale s tructure data ( Reid et alJ 2010a : 



Percival et al 
2009hl: 



1 r 



200^, 



supernova data (iHicken et alJ 
2009il ■ cluster measurements 



.Kessler et al. 

()Vikhlinin erall l2009bl: iMantz et alJ l2010cD . distance 



measurements (iRiess et all I2009D. measurements of 
strong (iSuvu et alj 120101: iFadely et^ l. 2009) an d weak 
(iMassev et alJ 120071 : iFu et all 12001 iSchrabback et alJ 
gravitational lensing effects. 
Observations of CMB have been playing an essen- 
tial role in testing the model and const raining its ba- 
sic pa rameters. The WMAP satellite (jBennett et al.l 
I2003allbh has been measuring temperature and polariza- 
tion anisotropies of the CMB over the full sky since 2001. 
With 7 years of integration, the errors in the tempera- 
ture spectrum at each multipole are dominated by cos- 
mic variance (rather than by noise) up to / ~ 550, and 
the sign al-to-noise at each m ultipole exceeds unity up to 
/ « 900 (jLarson et al.ll2010l) . The power spectrum of pri- 
mary CMB on smaller angular scales has been measured 
by other experiments u p to ? 3000 (^ eichardt ct al. | 
20091: IBrown et all2009l:lLueker et al.ll20icii: iFowler etall 



2010 ) 



The polarization data show the most dramatic im- 
provements over our earlier WMAP results: the 
temperature-polarization cross power spectra measured 
by WMAP at Z > 10 are still dominated by noise, and 
the errors in the 7-year cross power spectra have im- 
proved by nearly 40% compared to the 5-year cross power 
spectra. While the error in the p ower spectrum of the 
cosinological ii^-mode polarizatio n (jSeliak fc Zaldarriagal 
119971: iKamionkowski et al.1ll997[ ) averaged over 1 = 2- 
7 is cosmic-variance limited, individual multipoles are 
not yet cosmic-variance limited. Moreover, the cos- 
mological i? -mode pol arization has not been detected 
(iNolta et all [20 09: Ko matsu et al.l l2009al: IBrown et all 
120091: IChianeet al.ii2010ir 

The temperature-polarization (TE and TB) power 
spectra offer unique tests of the standard model. The TE 
spectrum can be predicted given the cosmological con- 
straints from the temperature power spectrum, and the 
TB spectrum is predicted to vanish in a parity-conserving 
universe. They also provide a clear physical picture of 
how the CMB polarization is created from quadrupole 
temperature anisotropy. We show the success of the stan- 
dard model in an even more striking way by measuring 
this correlation in map space, rather than in harmonic 
space. 

The constraints on the basic 6 parameters of a flat 
ACDM model (see Table |T]), as well as those on the 
parameters beyond the minimal set (see Tabled, con- 
tinue to improve with the 7-year WMAP temperature 
and polarization data, combined with improved exter- 
nal astrophysical data sets. In this paper, we shall 
give an update on the cosmological parameters, as de- 
termined from the latest cosmological data set. Our 
best estimates of the cosmological parameters are pre- 
sented in the last columns of Table [1] and [2] under the 



name " WMAP-|-BAO-|-i/o-" While this is the minimal 
combination of robust data sets such that adding other 
data sets does not significantly improve most parame- 
ters, the other data combinations provide better limits 
than WMAP+BAO+Hq in some cases. For example, 
adding the small-scale CMB data improves the limit on 
the primordial helium abundance, Yp (see Table |3| and 
Section 14.81) , the supernova data are needed to improve 
limits on properties of dark energy (see Table |4] and Sec- 
tionjS]), and the power spectrum of Luminous Red Galax- 
ies (LRGs; see Section I3.2.3|) improves limits on proper- 
ties of neutrinos (see footnotes g, h, and i in Table ID and 
Sections [Ml and ■ 

The CMB can also be used to probe the abundance as 
well as the physics of clu sters of galaxies , via the SZ ef- 
fect (IZel'dovich fc Sunva cv 1969; Sunva evfc Zel'dovichl 
I1972D . In this paper, we shaU present the WMAP 
measurement of the averaged profile of SZ effect mea- 
sured towards the directions of known clusters of galax- 
ies, and discuss implications of the WMAP measure- 
ment for the very small-scale {I > 3000) power spectrum 
recently m easured by the South Pole Telescope (SPT; 
i Lueker^al. 2010) and A tacama Cosmology Telescope 
fACT: IFowler et al]|2010l ) cohaborations. 

This paper is one of six papers on the analysis of the 
WMAP 7-year data: iJarosik et all (|2010D report on the 
dat a processing , map- making, and systematic error lim- 
its; IGold et al.l (|2010D on the modeling, understanding, 
and subtraction o f the tempera ture and polarized fore- 
ground emission; ILarson et al.l (j2010 ) on the measure- 
ments of the temperature and polarization power spectra, 
extensive testing of the parameter estimation method- 
ology by Monte Carlo simulations, and the cosmolog- 
ical parameters inferred from the WMAP data alone; 
iBennett et al. (2010) on the assessments of statistical sig- 
nificance of various "anomalies" in the WMAP temper- 
ature map reported in the literature; and lWeiland et ahl 
(|2010D on WMAP^s measurements of the brightnesses of 
planets and various celestial calibrators. 

This paper is organized as follows. In Section |21 we 
present results from the new method of analyzing the 
polarization patterns around temperature hot and cold 
spots. In Section 121 we briefiy summarize new aspects of 
our analysis of the WMAP 7-year temperature and po- 
larization data, as well as improvements from the 5-year 
data. In Sectional we present updates on various cosmo- 
logical parameters, except for dark energy. We explore 
the nature of dark energy in Section |SJ In Section [SI we 
present limits on primordial non-Gaussianity parameters 
/nl- In Section [71 we report detection, characterization, 
and interpretation of the SZ effect toward locations of 
known clusters of galaxies. We conclude in Section jS) 

2. CMB POLARIZATION ON THE MAP 

2.1. Motivation 

Electron-photon scattering converts quadrupole tem- 
perature anisotropy in the CMB at the decoupling 
epoch, z = 1090, into linear polarization (Rces 1963 
Basko & Polnarev 1980; Kaiser 1983; Bond & Efstathio3 
19841: iPolnarevi ,1985c .Bond fc Efstathiou ,1987t 
Harari fc Zaldarriagalll993l ). This produces a correlation 
between the temperatur e pat tern and t he polar ization 
pattern (jCoulson ct al.. 11994 [Crittenden et al.l [1995.) . 
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TABLE 1 

Summary of the cosmological parameters of ACDM model" 



Class 


Parameter 


WMAP 7-yoar ML'' 


WMAP+BAO+Ho ML 


WMAP 7-ycar Mean'' 


WAfAP+BAO+ffo Mean 


Primary 


Ua 
ris 
r 


2.227 
0.1116 
0.729 
0.966 
0.085 
2.42 X 10"^ 


2.253 
0.1122 
0.728 
0.967 
0.085 
2.42 X lO"'' 


2 249+0-056 

^■^^''-0.057 

0.1120 ± 0.0056 

-0.029 

0.967 ± 0.014 
0.088 ± 0.015 
(2.43 ± 0.11) X lO"'' 


2.255 ± 0.054 
0.1126 ± 0.0036 
0.725 ± 0.016 
0.968 ± 0.012 
0.088 ± 0.014 
(2.430 ± 0.091) X lO"*" 


Derived 


Ho 
to' 


0.809 
70.3 km/s/Mpc 
0.0451 
0.226 
0.1338 
10.4 
13.79 Gyr 


0.810 
70.4 km/s/Mpc 
0.0455 
0.226 
0.1347 
10.3 
13.76 Gyr 


'J-**-'- -'--0.031 

70.4 ± 2.5 km/s/Mpc 
0.0455 ± 0.0028 
0.228 ± 0.027 

10.6 ± 1.2 
13.77 ± 0.13 Gyr 


0.816 ± 0.024 
70.2 ± 1.4 km/s/Mpc 
0.0458 ± 0.0016 

0.229 ± 0.015 
0.1352 ± 0.0036 

10.6 ± 1.2 
13.76 ± 0.11 Gyr 



The parameters listed here are derived using tire RECFAST 1.5 and version 4.1 of the WMAP likelihood code. All 
the other parameters in the other tables are derived using the RECFAST 1.4.2 and version 4.0 of the WMAP likelihood 
c ode , unless stated otherwise. The difference is smalL Sec Appendix^pfor comparison. 
^ ILar son et al. (2010). "ML" refers to the Maximum Likelihood parameters. 

" ILarson et al., i2010,) . "Mean" refers to the mean of the posterior distribution of each parameter. The quoted errors 
show the 68% confidence levels (CL). 

A^(fe) = fc^P-R(fc)/(27r^) and feo = 0.002 Mpc"\ 
" "Redshift of rcionization," if the universe was reionizcd instantaneously from the neutral state to the fully ionized 
state at jSrcion- Note that these values are somewhat different from those in Table 1 of lKomatsu et"aT l|2009aD, largely 
because of the changes in the treatment of rcionization history in the Boltzmann code CAMB ^e^& [2Q0^) T^ 
^ The present-day age of the universe. 



TABLE 2 

Summary of the 95% confidence limits on deviations from the simple (flat, Gaussian, adiabatic, power-law) ACDM model except for dark energy 

parameters 



Section 



Name 



Case 



WAfAP 7-year 



WMAP-fBAO-t-SN" 



WMAP-l-BAO-fifo 



Section 



Sectionl4.2 



Section 
Section 



4.1 



Section l4.5l 
Section l4?6l 

Section lin 
Sectionf6l 



Grav. Wave^ 
Running Index 
Curvature 
Adiabatieity 

Parity Violation 
Neutrino Mass^ 



Rclativistic Species 
Gaussianity^ 



No Running Ind. 
No Grav. Wave 
w — —1 
Axion 
Curvaton 
Chern-Simons'^ 
w — —1 
w =^ -1 
w — —1 
Local 
Equilateral 
Orthogonal 



r < 0.36° 
-0.084 < dn^/dlnk < 0.020° 
N/A 
ao < 0.13" 
a_i < 0.011° 
-5.0° < Aa < 2.8°° 
X;™.. < 1.3 eV° 
J2™.^ < 1-4 eV° 
N^H > 2.7° 
-10 < /J^°"' < 74" 



-214 < /° 



< 266 



-410 < /"'''°« < 6 



r < 0.20 
-0.065 < dnjdlnk < 0.010 
-0.0178 < Ofc < 0.0063 
ao < 0.064 
a_i < 0.0037 
N/A 

< 0.71 eV 
J]m„ < 0.91 eV 
N/A 
N/A 
N/A 
N/A 



r < 0.24 
-0.061 < dnjdlnk < 0.017 
-0.0133 < flfc < 0.0084 
ao < 0.077 
Q_i < 0.0047 
N/A 
J2 < 0.58 eVB 
J2™,u < 1.3 eV'' 
■i-3-itoH (68% CL)' 
' N/A 
N/A 
N/A 



" "SN" denotes the " Constitution" sample of Type la supernovae compiled by IHicken et all ]2009b|l, which is a n exten sion of the "Union" sample 
]Kowalski et ani2008D that we used for the 5-year " WA/fAP-t-BAO-t-SN" parameters presented in ^omatsu et al.l j2009al) . Systematic errors in the 
supernova data are not included. While the parameters in this column can be compared directly to the 5-year WAfAP-t-BAO-t-SN parameters, they may 
not be as robust as the "W A<fAP-j-BAO-|-ifo" parameters, as the other compilations of the supernova data do not give the same answers (Hicken et a^ 
l2009bl : [Kessler et al.ll200g| ). See Section [3T01 for more discussion. The SN data will be used to put limits on dark energy properties. See Section [5] and 
Table H 

^ In the form of the tensor-to-scalar ratio, r, at — 0.002 Mpc~^. 
° ILarson et al.l 1 12010) . 

*^ For an interaction of the form given by [ip(t) / A4]Faj3 F"^^ ^ the polarization rotation angle is Aa — A/^' / —0. 

° The 68% CL limit is Aa — —1.1° ± 1.4° (stat.) it 1.5° (syst.), where the first error is statistical and the second error is systematic. 

' = 94(n^h^) eV. 

« For WAfAP-l-LRG-l-ffo, E ™^ < 0.44 eV. 

'' For WlfAP-f LRG-l-J/o , E ™^ < 0-71 eV. 

' The 95% limit is 2.7 < Afeft < 6.2. For WAfAP-|-LRG-l-ffo, N^ii = 4.25 ± 0.80 (68%) and 2.8 < N^tt < 5.9 (95%). 

^ VH-W map masked by the KQ75y7 mask. The Galactic foreground tem plates are margina lized over. 

^ When combined with the limit on /JjJ"' from SDSS, -29 < < 70 l lSlosar et al.ll200a) , we find -5 < /JjJ^"' < 59. 



Different mechanisms for generating fluctuations pro- 
duce distinctive correlated patterns in temperature and 
polarization; 

1. Adiabatic scalar fluctuations predict a radial po- 
larization pattern around temperature cold spots 
and a tangential pattern around temperature hot 
spots on angular scales greater than the horizon 
size at the decoupling epoch, > 2°. On angular 



scales smaller than the sound horizon size at the 
decoupling epoch, both radial and tangential pat- 
terns are formed around both hot and cold spots, 
as the acoustic oscillatio n of the CMB modu lates 
the polarization pattern (jCoulson et aLllTOOl) . As 
we have not seen any evidence fo r non-adiabatic 
fluctuations ([Komatsu et al.|[2009al see Section 
for the 7-year limits), in this section we shall as- 
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TABLE 3 
Primordial helium Abundance" 



Yj, < 0.51 (95% CL) Yp = 0.326 ± 0.075 (68% CL)*" 

bee Section lTsl 
The 95% CL limit is 0.16 < Yp < 0.46. For 
WMAP+ ACBAR+QUaD+LRG+ffo, Yh^ = 0.349 ± 
0.064 (68% CL) and 0.20 < Yp < 0.46 (95% CL). 

sume that fluctuations are purely adiabatic. 

2. Tensor fluctuations predict the opposite pat- 
tern: the temperature cold spots are surrounded 
by a tangential polarization pattern, while the 
hot spots are su rrounded by a radial pattern 
(jCrittenden et al l 1995). Since there is no acoustic 
oscillation for tensor modes, there is no modula- 
tion of polarization patterns around temperature 
spots on small angular scales. We do not expect 
this contribution to be visible in the WMAP data, 
given the noise level. 

3. Defect models predict that there should be minimal 
correlations between temperature and polarization 
on 2° < 6* < 10° (Scljak et al. 1997). The detec- 
tion of large-scale temperature polarization fluc- 
tuations rules out any causal models as the pri- 
mary n iechanism for generating th e CMB fluctu- 
ations fSpergel fc Zaldarriagalll997f) . This implies 
that the fluctuations were either generated during 
an accelerating phase in the early universe or were 
present at the time of the initial singularity. 

This section presents the first direct measurement of 
the predicted pattern of adiabatic scalar fluctuations in 
CMB polarization maps. We stack maps of Stokes Q and 
U around temperature hot and cold spots to show the ex- 
pected polarization pattern at the statistical significance 
level of 8(7. While w e have detected the TE correlations 
in the first year data (jKogut et al.ll2003l ). we present here 
the direct real space pattern around hot and cold spots. 
In Section 12.51 we discuss the relationship between the 
two measurements. 

2.2. Measuring Peak-Polarization Correlation 

We first identify temperature hot (or cold) spots, and 
then stack the polarization data (i.e.. Stokes Q and U) 
on the locations of the spots. As we shall show below, 
the resulting polarization data is equivalent to the tem- 
perature peafc-polarization correlation function which is 
similar to, but different in an important way from, the 
temperature-polarization correlation function. 

2.2.1. Qr and Ur'- Transformed Stokes Parameters 

Our defin i tions of Stokes Q and U follow that of 
iKogut et al.l (|2003f ): the polarization that is parallel to 
the Galactic meridian is Q > and ?7 = 0. Starting from 
this, the polarization that is rotated by 45° from east to 
west (clockwise, as seen by an observer on Earth looking 
up at the sky) has Q — Q and C/ > 0, that perpendicular 
to the Galactic meridian has Q < and C/ = 0, and that 
rotated further by 45° from east to west has (5 = and 
t/ < 0. With one more rotation we go back to Q > and 
[7 = 0. We show this in Figure [TJ 



* Q<0,U=0 




Fig. 1. — Coordinate system for Stokes Q and U . We use Galactic 
coordinates with north up and east left. In this example, Qr is 
always negative, and Up is always zero. When Qr > and Ur = 0, 
the polarization pattern is radial. 



As the predicted polarization pattern around temper- 
ature spots is either radial or tangential, we find it most 
convenient to wo rk with^Q^ and Up first introduced by 
iKamionkowski et all (|199^ : 

Qr{0) = -Q{e) cos(20) - U{e) sin(20), (1) 
Up{e) = Q{e) sm{2(f)) - U{e) cos(2<?!)). (2) 

These transformed Stokes parameters are defined with 
respect to the new coordinate system that is rotated by 
(j), and thus they are defined with respect to the line con- 
necting the temperature spot at the center of the coordi- 
nate and the polarization at an angular distance 9 from 
the center (also see Figure [T]) . Note that we have used 
the small-angle (flat-sky) approximation for simplicity of 
the algebra. This approximation is justified as we are 
interested in relatively small angular scales, < 5°. 

The above definition of Qr is equivalent to the so-called 
"tangential shear" statistic used by the weak gravita- 
tional lensing community. By following what has been 
already done for the tangential shear, we can find the 
necessary formulae for Qp and Specifically, w e shall 
follow the derivations given in iJeong et al.l (|2009l ). 

With the small-angle approximation, Q and U are re- 
lated to the E- and i?-mode polarization in Fourier space 
(jSeliak fc Zaldarriagalll997t IKamionkowski et al.l I1997D 
as 

- Q{0) = j [El cos(2(p) - B, sin(2^)] e^' ^ (3) 

-U{e) = j -0^ [E, sin(2^) + B, cos(2^)] e^''^ (4) 

where (/? is the angle between 1 and the line of Galac- 
tic latitude, 1 = (/cos(p,/sin<p). Note that we have in- 
cluded the negative signs on the left hand side because 
our sign convention for the Stokes parameters is oppo- 
site of that used in equation (38) of .Zaldarriaga fc SeliaH 



WMAP only 



WMAP+ACBAR+QUaD 




WMAP 7-year Cosmological Interpretation 



5 



TABLE 4 

Summary of the 68% limits on dark energy properties from WMAP combined with other 

DATA sets 



Section 


Curvature 


Parameter 


+BAO+//0 


+BAO+ffo+£'At" 


+BAO+SN'' 


Scction|5.1| 
Section f5?2] 


fifc - 
/ 


Constant w 
Constant w 


-1.10 ± 0.14 
-1.44 ± 0.27 


-1.08 ± 0.13 
-1.39 ± 0.25 
n ni 1 1 +0.0060 

^•"-^-^-^-0.0063 


-0.980 ± 0.053 

^■■^^^-0.056 










+BAO+H0+SN 


+BAO+Ho+DAt+SN 


SectiQn|5.3l 


- 


Wo 
Wa 


-0.83 ± 0.16 
-0.80_o 33 


-0.93 ± 0.13 
-0 41+°-''^ 


-0.93 ± 0.12 

-o.38lS:«l 



^ "-DAt" denotes the time-delay distance to the lens system B1608+656 at z — 0.63 measured 
bv lSuvu et al.1 <|2010H . See Section f3?T5] for details. 

^ "SN" denotes the "Constitution" sample of Type l a supernovae compiled by iHicken et ahl 
l |2009bD . which is an extension of the "Union" sample ||KQwaJskj_et_ar|[20 0^) tha t we used for 
the 5-year " WMAP-|-BAO-|-SN" parameters presented iii TKornatsi^^an ' 72009an Systematic 
errors in the supernova data are not included. 



(|1997[) . The transformed Stokes parameters are given by 



Qr{0) 



{Sicos[2(0-(^)] 



(5) 



-Bicos[2(</.- v')]}e''-^. (6) 

The stacking of Qr and Ur at the locations of temper- 
ature peaks can be written as 

{Qr){0) = / d^nAf (n)(npk(n)Q,(n + 9)}, (7) 
{Ur)ie)^^ [ (fhMin)Mh)Ur{n + e)), (8) 

JVpk J 

where the angle bracket, (...), denotes the average over 
the locations of peaks, npk(n) is the surface number den- 
sity of peaks (of the temperature fluctuation) at the lo- 
cation n, A^pk is the total number of temperature peaks 
used in the stacking analysis, and M(n) is equal to at 
the masked pixels and 1 otherwise. Defining the density 
contrast of peaks, 5pk = rtpk/"-pk — 1, we find 

{Qr){e) = ^ [ ^M{h){d^^{h)Qr{h + 9)), (9) 
/sky J 47r 

{Ur)i9) = ^ I ^M(n)(5pk(n)C/,(n + ^)),(10) 
/sky J 47r 

where /sky = / M{n)d?hl [A-k) is the fraction of sky out- 
side of the mask, and we have used iVpk = 47r/sky7tpk- 

In Appendix |B1 we use the statistics of peaks of Gaus- 
sian random fields to relate {Qr) to the temperature- i?- 
mode polarization cross power spectrum C™, {Ur) to 
the temperature- i3-mode polarization cross power spec- 
trum C™, and the stacked temperature profile, (T), to 
the temperature power spectrum Cj"^ . We find 

{Qr){e)^- j ^^wrwf{K + b,f)crJ2{ie),{ii) 

{T)ie)^ I ^-^{wn\K+ki')crJom, (13) 



where Wj^ and Wf are the harmonic transform of win- 
dow functions, which are a combination of the experi- 
mental beam, pixel window, and any other additional 
smoothing applied to the temperature and polarization 
data, respectively, and by + brP is t he "s cale-dependent 
bias" of peaks found by iDesiacquesI (|2008l ) averaged over 
peaks. See Appendix IB] for details. 

2.2.2. Prediction and Physical Interpretation 

What do {Qr){d) and {Ur){9) look like? The Qr map is 
expected to be non-zero for a cosmological signal, while 
the Ur map is expected to vanish in a parity-conserving 
universe unless some systematic error rotates the polar- 
ization plane uniformly. 

To understand the shape of Qr as well as its physi- 
cal implications, let us begin by showing the smoothed 
C™ spectra and the corresponding temperature-Qr cor- 
relation functions, C'^'^''{6), in Figure [2] (Note that 
C^Q^ and C^^- can be computed from equations Ijlip 
and (|12p . respectively, with b^ = 1 and b^ = 0.) This 
sho ws three distinct effects causing polarization of CMB 
(see lHu fc~ White 1997^ for a pedagogical review): 

(a) > 20horizon, where ^horizon is the angular size 
of the radius of the horizon size at the decou- 
pling epoch. Using the comoving horizon size 
of Thorizon = 0.286 Gpc and the comoving angu- 
lar diameter distance to the decoupling epoch of 
dA — 14 Gpc as derived from the WMAP data, we 
find 6'horizon — 1-2°. As this scale is so much greater 
than the sound horizon size (see below), only grav- 
ity affects the physics. Suppose that there is a New- 
tonian gravitational potential, $Ar, at the center of 
a perturbation, 6 — Q. If it is overdense at the cen- 
ter, < 0, and thus it is a cold s pot according 
to the Sachs-Wolfe formula (iSachs fc Wolfe 1967), 
AT/T = $Ar/3 < 0. The photon fluid in this re- 
gion will flow into the gravitational potential well, 
creating a converging flow. Such a flow creates the 
quadrupole temperature anisotropy around an elec- 
tron ai 9 > 26'horizon, producing polarization that is 
radial, i.e., Qr > 0. Since the temperature is neg- 
ative, we obtain {TQ r) < 0, i.e., anti-correlation 
(jCoulson et al.l Il994f) . On the other hand, if it 
is overdense at the center, then the photon fluid 
moves outward, producing polarization that is tan- 
gential, i.e., Qr < 0- Since the temperature is pos- 
itive, we obtain {TQr) < 0, i.e., anti-correlation. 
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Fig. 2. — Temperature-polarization cross correlation with various smoothing functions. (Left) The TE power spectrum with no smoothing 
is shown in the black solid line. For the other curves, the temperature is always smoothed with a 0.5° (FWHM) Gaussian, whereas the 
polarization is smoothed with either the same Gaussian (black dashed), Q-band beam (blue solid), V-band beam (purple solid), or W-band 
beam (red dashed). (Right) The corresponding spatial temperature-Qr correlation functions. The vertical dotted lines indicate (from left 
to right): the acoustic scale, 2xthe acoustic scale, and 2xthe horizon size, all evaluated at the decoupling epoch. 




e 

Fig. 3. — Predicted te mper ature peak-polarization cross correlation, as measured by the stacked profile of the transformed Stokes Qr, 
computed from equation Ullj l for various values of the threshold peak heights. The temperature is always smoothed with a 0.5° (FWHM) 
Gaussian, whereas the polarization is smoothed with either the same Gaussian (black dashed), Q-band beam (blue solid), V-band beam 
(purple solid), or W-band beam (red dashed). (Top left) All temperature hot spots are stacked. (Top right) Spots greater than la are 
stacked. (Bottom left) Spots greater than 2cr are stacked. (Bottom right) Spots greater than 3cr are stacked. The light gray lines show the 
average of the measurements from noiseless simulations with a Gaussian smoothing of 0.5° FWHM. The agreement is excellent. 



The anti-correlation at ^ > 20horizon is a smoking- 
gun for the presence of sup er-horizon f luctuations 
at th e decoupling epoch (jS per gel & Zaldarriag3 
I1997D, which has been confirmed by the WMAP 
data (jPeiris et al.ll2003[ ). 

(b) 9 ~ 29a, where 9a is the angular size of the ra- 
dius of the sound horizon size at the decoupling 
epoch. Using the comoving sound horizon size of 
= 0.147 Gpc and = 14 Gpc as derived from 



the WMAP data, we find 9a = 0.6°. Again, con- 
sider a potential well with $Ar < at the center. 
As the photon fluid fiows into the well, it com- 
presses, increasing the temperature of the photons. 
Whether or not this increase can reverse the sign of 
the temperature fiuctuation (from negative to pos- 
itive) depends on whether the initial perturbation 
was adiabatic. If it was adiabatic, then the tem- 
perature would reverse sign at 9 < 20hoiizon- Note 




e 



e 



Fi g. 4 . — Predicted temperature peak-temperature correlation, as measured by the stacked temperature profile, computed from equa- 
tion H13| l for various values of the threshold peak heights. The choices of the smoothing functions and the threshold peak heights are the 
same as in Figure [3] 



that the photon fluid is still flowing in, and thus the 
polarization direction is radial, Qr > 0. However, 
now that the temperature is positive, the correla- 
tion reverses sign: (TQr) > 0. A similar argument 
(with the opposite sign) can be used to show the 
same result, (TQr) > 0, for $Ar > at the cen- 
ter. As an aside, the temperature reverses sign on 
smaller angular scales for isocurvature fluctuations. 

(c) 6 c:i 6a- Again, consider a potential well with 
<i>Ar < at the center. At 9 < 20 a, the pressure of 
the photon fluid is so great that it can slow down 
the flow of the fluid. Eventually, at 6* ~ 6*^, the 
pressure becomes large enough to reverse the di- 
rection of the flow (i.e., the photon fluid expands). 
As a result the polarization direction becomes tan- 
gential, Qr < 0; however, as the temperature is 
still positive, the correlation reverses sign again: 

{TQr) < 0. 

On even smalle r scales, the correlati on reverses sign again 
(see Figure 2 of lCoulson et al.lll994l ) because the temper- 
ature gets too cold due to expansion. We do not see this 
effect in Figure [5] because of the smoothing. Lastly, there 
is no correlation between T and Qr a,t — because of 
symmetry. 

These features are essentially preserved in the peak- 
polarization correlation as measured by the stacked po- 
larization profiles. We show them in Figure [3] for various 
values of the threshold peak heights. The important dif- 
ference is that, thanks to the scale-dependent bias oc P, 
the small-scale trough at ~ 6'a is enhanced, making 
it easier to observe. On the other hand, the large-scale 



anti-correlation is suppressed. We can therefore conclude 
that, with the WMAP data, we should be able to mea- 
sure the compression phase at ~ 20a = 1.2°, as well 
as the reversal phase at ~ 6*^ = 0.6°. We also show 
the profiles calculated from numerical simulations (gray 
solid lines). The agreement with equation (1111) is excel- 
lent. We also show the predicted profiles of the stacked 
temperature data in Figure U] 

2.3. Analysis Method 
2.3.1. Temperature Data 

We use the foreground-reduced V-l-W temperature 
map at the HEALPix resolution of A'sido = 512 to find 
temperature peaks. First, we smooth the foreground- 
reduced temperature maps in 6 differencing assemblies 
(DAs) (VI, V2, Wl, W2, W3, W4) to a common resolu- 
tion of 0.5° (FWHM) using 



Ar(n) 



Irn 



air 



, , ^ Ir 

hi 



.(ft), 



(14) 



where hi is the appropriate beam transfer function for 
each DA (jJarosik et al.ll2010[ ). and Wf = p;exp[-/(Z + 
l)(Tp^jjj^/(161n2)] is the pixel window function for 
A'sidc = 512, p;, times the spherical harmonic transform 
of a Gaussian with ctfwhm — 0.5°. We then coadd the 
foreground-reduced V- and W-band maps with the in- 
verse noise variance weighting, and remove the monopole 
from the region outside of the mask (which is already 
negligibly small, 1.07 x 10"'' /^K). For the mask, we com- 
bine the new 7-year KQ85 mask, KQ85y7 (defined in 
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iGold et al]|2010l also see Section and P06 masks, 
leaving 68.7% of the sky available for the analysis. 

We find the locations of minima and maxima us- 
ing the software " hotspot" in the HEALPix package 
(|Gorski et all I2005D . Over the fuU sky (without the 
mask), we find 20953 maxima and 20974 minima. As 
the maxima and minima found by hotspot still contain 
negative and positive peaks, respectively, we further se- 
lect the "hot spots" by removing all negative peaks from 
maxima, and the "cold spots" by removing all positive 
peaks from minima. This procedure corresponds to set- 
ting the threshold peak height to vt = 0; thus, our pre- 
diction for {Qr){S) is the top-left panel of Figure |31 

Outside of the mask, we find 12387 hot spots and 
12628 cold spots. The r.m.s. temperature fluctuation 
is (To = 83.9 ^K. What does the theory predict? Us- 
ing equation (IB15|) with the power spectrum Cj-"^ = 
(^TT.sig„ai^2 + Nrm)eM-Kl + l)a|wHM/(81n2)] 
where N'f'^ — 7.47 x 10 /iK^ sr is the noise bias of 
the V-l-W map before Gaussian smoothing and c'^tt, signal 
is the 5-year best-fitting power-law ACDM temperature 
power spectrum, we find 47r/skyfipk = 12330 for vt — 
and /sky = 0.687; thus, the number of observed hot and 
cold spots is consistent with the predicted number 

2.3.2. Polarization Data 

As for the polarization data, we use the raw (i.e., with- 
out foreground cleaning) polarization maps in V and W 
bands. We have checked that the cleaned maps give 
similar results with slightly larger error bars, which is 
consistent with the excess noise introduced by the tem- 
plate foreground cleaning procedure (Page et al. 2007; 
[Gold et al.l 120091 12010|). As we are focused on relatively 
small angular scales, 9 < 2°, in this analysis, the re- 
sults presented in this section would not be affected by 
a potential systematic effect causing an excess power in 
the W-band polarization data on large angular scales, 
/ < 10. However, note tha t this excess p ower could just 
be a statistical fluctuation (jJarosik et al.l HolO'). We form 
two sets of the data: (i) V, W, and V-l-W band maps 
smoothed to a common resolution of 0.5°, and (ii) V, W, 
and V-l-W band maps without any additional smoothing. 
The first set is used only for visualization, whereas the 
second set is used for the analysis. 

We extract a square region of 5° x 5° around each tem- 
perature hot or cold spot. We then coadd the extracted 
T images with uniform weighting, and Q and U images 
with the inverse noise variance weighting. We have elim- 
inated the pixels masked by KQ85y7 and P06 from each 
5° X 5° region when we coadd images, and thus the re- 
sulting stacked image has the smallest noise at the center 
(because the masked pixels usually appear near the edge 
of each image). We also accumulate the inverse noise 
variance per pixel as we coadd Q and U maps. The coad- 
ded inverse noise variance maps of Q and U will be used 
to estimate the errors of the stacked images of Q and U 
per pixel, which will then be used for the analysis. 

We find that the stacked images of Q and U have con- 
stant offsets, which is not surprising. Since these affect 

^® Note that the predicted number is 47r/skySpk = 10549 if we 
ignore the noise bias; thus, even with a Gaussian smoothing, the 
contribution from noise is not negligible. 



TABLE 5 

Statistics of the results from the stacked 
polarization analysis 



Data Combination^ 




best-fitting Amplitude'^ 




Hot, Q, V+W 
Hot, U, V+W 
Hot, Qr, V+W 
Hot 71 V-l-W 


661.9 
661.1 
694.2 
629 2 


0.57 ± 0.21 
1.07 ± 0.21 
0.82 ± 0.15 

— U.J-O m 'J.±u 


-7.3 
-24.7 
-29.2 
— 18 


Cold, Q, V+W 
Cold, U. V+W 
Cold, Qr, V+W 
Cold, Ur, V+W 


668.3 
682.7 
682.2 
657.8 


0.89 ± 0.21 
0.86 ± 0.21 
0.90 ± 0.15 
0.20 ± 0.15 


-18.2 
-16.7 
-36.2 
-0.46 


Hot, Q, V-W 
Hot, U, V-W 
Hot, Qr, V-W 
Hot, Ur, V-W 


559.8 
629.8 
662.2 
567.0 






Cold, Q, V-W 
Cold, U, V-W 
Cold, Qr, V-W 
Cold, Ur. V-W 


584.0 
668.2 
616.0 
636.9 







^ "Hot" and "Cold"' denote the stacking around tempera- 
ture hot spots and cold spots, respectively. 
^ Computed with respect to zero signal. The number of 
degrees of freedom (DOF) is 25^ = 625. 

^ Best-fitting amplitudes for the corresponding theoretical 
predictions. The quoted errors show the 68% confidence 
level. Note that, for Ur, we used the prediction for Qr', 
thus, the fitted amplitude may be interpreted as sin(2Aa), 
where Aa is the rotation of the polarization plane due to, 
e.g., violation of global parity symmetry. 

Difference between the second column and after re- 
moving the model with the best-fitting amplitude given in 
the 3rd column. 

our determination of polarization directions, we remove 
monopoles from the stacked images of Q and U. The 
size of each pixel in the stacked image is 0.2°, and the 
number of pixels is 25^ = 625. 

Finally, we compute Qr and Ur from the stacked im- 
ages of Stokes Q and U using equations H]) and ([2]) , re- 
spectively. 

2.4. Results 

In Figure [5] and [6l we show the stacked images of T, 
Q, U, Qr, and Ur around temperature cold spots and 
hot spots, respectively. The peak values of the stacked 
temperature profiles agree with the predictions (see the 
dashed line in the top- left panel of Figure S]) . A dip in 
temperature (for hot spots; a bump for cold spots) at 
61 ~ 1° is clearly visible in the data. While the Stokes 
Q and U measured from the data exhibit the expected 
features, they are still fairly noisy. The most striking im- 
ages are the stacked Qr (and T). The predicted features 
are clearly visible, particularly the compression phase at 
1.2° and the reversal phase at 0.6° in Q,.: the polariza- 
tion directions around temperature cold spots are radial 
at ~ 0.6° and tangential at ~ 1.2°, and those around 
temperature hot spots show the opposite patterns, as 
predicted. 

How significant are these features? Before performing 
the quantitative analysis, we first compare Qr and Ur 
using both the (V+W)/2 sum map (here, V-l-W refers 
to the inverse noise variance weighted average) as well as 
the (V— W)/2 difference map (bottom panels of Figure [5] 
and [6|) . The Qr map (which is expected to be non-zero 
for a cosmological signal) shows clear differences between 
the sum and difference maps, while the Ur map (which 
is expected to vanish in a parity-conserving universe un- 
less some systematic error rotates the polarization plane 
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Fig. 5. — Stacked images of temperature and polarization data around temperature cold spots. Each panel shows a 5° X 5° region with 
north up and east left. Both the temperature and polarization data have been smoothed to a common resolution of 0.5°. (Top) Simulated 
images with no instrumental noise. From left to right: the stacked temperature, Stokes Q, Stokes U , and transformed Stokes Qr (see 
equation |[TJ) overlaid with the polarization directions. (Middle) WMAP 7-year V+W data. In the observed map of Qr, the compression 
phase at 1.2° and the reversal phase at 0.6° are clearly visible. (Bottom) Null tests. From left to right: the stacked Qr from the sum 
map and from the difference map (V— W)/2, the stacked Ur from the sum map and fr om t he difference map. The latter three maps are 
all consistent with noise. Note that Ur, which probes the TB correlation (sec equation (I12l l). is expected to vanish in a parity-conserving 
universe. 
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uniformly) is consistent with zero in both the sum and 
difference maps. 

Next, we perform the standard analysis. We sum- 
marize the results in Table \E\ We report the values of 
measured with respect to zero signal in the second 
column, where the number of degrees of freedom (DOF) 
is 625. For each sum map combination, we fit the data 
to the predicted signal to find the best-fitting amplitude. 

The largest improvement in is observed for Qr, as 
expected from the visual inspection of Figure [5] and [6l 
we find 0.82 ± 0.15 and 0.90 ± 0.15 for the stacking of Qr 
around hot and cold spots, respectively. The improve- 
ment in is = —29.2 and —36.2, respectively; thus, 
we detect the expected polarization patterns around hot 
and cold spots at the level of 5Aa and 6a, respectively. 
The combined significance exceeds 8a. 

On the other hand, we do not find any evidence for 
Ur- The values with respect to zero signal per DOF 
are 629.2/625 (hot spots) and 657.8/625 (cold spots), 
and the probabilities of finding larger values of x^ are 
44.5% and 18%, respectively. But, can we learn anything 
about cosmology from this result? While the standard 
model predicts C™ — and hence {Ur) = 0, models 
in which the global parity symmetry is violated ca.n cre- 
ate = sin(2Aa)C™ (|Lue et al.lll999l : ICarrollllToM 
iFeng et al.l [20051 ). Therefore, we fit the measured Ur 
to the predicted Qr, finding a null result: sin(2Aa) = 
-0.13 ± 0.15 and 0.20 ± 0.15 (68% CL), or equivalently 
Aa = -3.7° ± 4.3° and 5.7° ± 4.3° (68% CL) for hot 
and cold spots, respectively. Averaging these numbers, 
we obtain Aa = 1.0° ± 3.0° (68% CL), which is consis- 
tent with (although not as stringent as) the limit we find 
from the full analysis presented in Section |4?5] Finally, 
all the x^ values measured from the difference maps are 
consistent with a null signal. 

How do these results compare to the full analysis of 
the TE power spectrum? By fitting the 7- year Ci data 
to the same power spectrum used above (5-year best- 
fitting power-law ACDM model from I = 24 to 800, 
i.e., DOF=777), we find the best- fitting amplitude of 
0.999 ±0.048 and Ax^ = -434.5, i.e., a 21cr detection of 
the TE signal. This is reasonable, as we used only the 
V- and W-band data for the stacking analysis, while we 
used also the Q-band data for measuring the TE power 
spectrum; (Qr) [0) is insensitive to information on > 2° 
(see top left panel of Figure [S]) ; and the smoothing sup- 
presses the power at I > 400 (see left panel of Figure [5]). 
Nevertheless, there is probably a way to extract more in- 
formation from {Qr){0) by, for example, combining data 
at different threshold peak heights and smoothing scales. 

2.5. Discussion 

If the temperature fluctuations of the CMB obey Gaus- 
sian statistics and global parity symmetry is respected on 
cosmological scales, the temperature-iJ-mode polariza- 
tion cross power spectrum, C^^, contains all the infor- 
mation about the temperature-polarization correlation. 
In this sense, the stacked polarization images do not add 
any new information. 

The detection and measurement of the temperature- 
E m ode polarization cross-correlation power spectrum, 
C;^ (|Kovac et al.l[2002l: IKoeut et al.l[2003 l 'Spergel_eLaLl 
[2003|) . can be regarded as equivalent to finding the pre- 



TABLE 6 

Polarization Data: Improvements from the 5- year data 



/ Range 


Type 


7-year 


5-year 


High r 


TE 
TB 


Detected at 21ct 
Aq = -0.9° ± 1.4° 


Detected at 13cr 
Aq = -1.2° ± 2.2° 


Low l" 


EE 

BB 
EE/BB 
TB/EB 


T = 0.088 ± 0.015 
r < 2.1 (95% CL) 
r < 1.6 (95% CL) 
Aa = -3.8° ± 5.2° 


T = 0.087 ± 0.017 
r < 4.7 (95% CL) 
r < 2.7 (95% CL) 
Aq = -7.5° ± 7.3° 


All / 


TE/EE/BB 
TB/EB"^ 


r < 0.93 (95% CL) 
Aq = -1.1° ± 1.4° 


r < 1.6 (95% CL) 
Aq = -1.7° ± 2.1° 



° ; > 24. The Q-, V-, and W-band data are used for the 7-ycar 
analysis, whereas only the Q- and V-band data were used for 
the 5-year analysis. 

^ 2 < I < 23. The Ka-, Q-, and V-band data arc used for both 
the 7-year and 5-year analyses. 

° The quoted errors are statistical only, and do not include the 
systematic error ±1.5° (see Section l4.5l >. 

dieted polarization patterns around hot and cold spots. 
While we have shown that one can write the stacked po- 
larization profile around temperature spots in terms of an 
integral of C™ , the formal equivalence between this new 
method and C'f^ is valid only when temperature fluctu- 
ations obey Gaussian statistics, as the stacked Q and 
U maps measure correlations between temperature peaks 
and polarization. So far there is no convincing evidence 
for non-Gaussianity in the temperature fl uctuations ob- 
served by WMAP IjKomatsu et al.l 120031 see Section M 
for the 7-year limits on primordial non-Gaussianity, and 
Bennett et al. 2010 for discussion on other non-Gaussian 
features). 

Nevertheless, they provide striking confirmation of our 
understanding of the physics at the decoupling epoch in 
the form of radial and tangential polarization patterns 
at two characteristic angular scales that are important 
for the physics of acoustic oscillation: the compression 
phase a.t 9 = 29a and the reversal phase at 9 = 9a- 

Also, this analysis does not require any analysis in 
harmonic space, nor decomposition to E and B modes. 
The analysis is so straightforward and intuitive that the 
method presented here would also be useful for null tests 
and systematic error checks. The stacked image of Ur 
should be particularly useful for systematic error checks. 

Any experiments that measure both temperature and 
polarization should be able to produce the stacked images 
such as presented in Figure [5] and [6l 

3. SUMMARY OF 7- YEAR PARAMETER ESTIMATION 
3.1. Improvements from the 5-year Analysis 

Foreground Mask. The 7-year temperature analy- 
sis masks, KQ85y7 and KQ75y7, have been slightly en- 
larged to mask the regions that have excess foreground 
emission, particularly in the HII regions Gum and Ophi- 
uchus, identified in the difference between foreground- 
reduced maps at different frequencies. As a result, the 
new KQ85y7 and KQ75y 7 masks eliminate an additional 
3.4% and 1.0% of the sky, leaving 78.27% and 70.61% of 
the sky for th e cosmolog ical analyses, respectively. See 
Section 2.1 of iGold et all (j20Tol ) for details. There is no 
change in the po larization P06 mask (see Section 4.2 of 
iPage et aLll2007l for definition of this mask) , which leaves 
73.28% of the sky. 

Point Sources and the SZ Effect. We continue 
to marginalize over a contribution from unresolved point 
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sources, assuming that the antenna temperature of point 
sources de chnes with freque ncy as 1^-2.09 ^ggg equa- 
tion (5) of iNolta et~aIll2009D . The 5-year estimate of 
the power spectrum from unresolved point sources in Q 
band in units of antenna temperature, Apg, was lO'^Apg — 
11 ± 1 /iK^ sr (jNolta et aLll2009( ). and we used this value 
and the error bar to marginalize over the power spectrum 
of residual point sources in the 7-year parameter estima- 
tion. The subsequent analysis showed that the 7-year es- 
timate of t he power spectrum is lO'^^ps = 9.0±0.7 /iK^ sr 
([Larson et a l. 2010), which is somewhat lower than the 5- 
year value because more sources are resolved by WMAP 
and included in the source mask. The difference in the 
diffuse mask (between KQ85y5 and KQ85yT) does not af- 
fect the value of A-ps very much: we find 9.3 instead of 9.0 
if we use the 5-year diffuse mask and the 7-year source 
mask. The source power spectrum is sub-dominant in 
the total power. We have checked that the parameter re- 
sults are insensitive to the difference between the 5-year 
and 7- year residual source estimates. 

We continue to marginalize over a contribution from 
the SZ effect using the same t emplate as f or the 3- 
and 5-year analyses (jKomatsu fc Scljak 20021 ). We as- 
sume a uniform prior on the amplitude of this template 
as < Asz < 2, which is now justified by the latest 
limits fro m the SPT collabo ration, Asz = 0.37 ± 0.17 
(68% CL: iLueker eTalllMcl. and the ACT collabora- 
tion, Agz < 1.63 (95% CLn^wEEeEaDPOlO). 

High-/ Temperature and Polarization. We in- 
crease the multipole range of the power spectra used for 
the cosmological parameter estimation from 2 — 1000 to 
2 - 1200 for the TT power spectrum, and from 2 - 450 
to 2 — 800 for the TE pow er spectrum. We u se the 7- 
year V- and W-band maps ( Jar osik et al.|[20T^ to mea- 
sure the high-/ TT power spectrum in / = 33 — 1200. 
While we used only Q- and V-band maps to measure the 
high-/ TE and TB power spectra for the 5-year analysis 
(|Nolta et al.l[2009l) . we also include W-band maps in the 
7-year high-/ polarization analysis. 

With these data, we now detect the high-/ TE power 
spectrum at 21(T, compared to 13(7 for the 5-year high- 
/ TE data. This is a consequence of adding two more 
years of data and the W-band data. The TB data can be 
used to probe a rotation angle of the polarization plane, 
Aa, due to potential parity-violating effects or system- 
atic effects. With the 7-year high-/ TB data we find 
a limit Aa = -0.9° ± 1.4° (68% CL). For comparison, 
the limit from the 5-year hig h-/ TB power spectrum was 
Aa = -1.2°±2.2° (68% CL: IKomatsu et al.l[2009al ) . See 



Section 14751 for the 7- year limit on Aa from the full anal- 
ysis. 

Low-/ Temperature and Polarization. Except 
for using the 7-year maps and the new tempera- 
ture KQ85y7 mask, there is no change in the anal- 
ysis of the low-/ temperature and polarization data: 
we use the Inte rnal Linear Combination (ILC) map 
(|Gold et al.ll2010D to measure the low-/ TT power spec- 
trum in / = 2 — 32, and calculate the likelihood us- 
ing the Gibbs sampling and Black well-Rao (BR ) esti- 
mato r (IJewellet al.(l2004l: IWandeltl 12 003: Wandel t et al 



2OOI lO 'Dwver et al."2004( 'Erik sjn et al.,i2004 i2007a 

Chu et al. 2005; Larson et al. 2OO70. For the implemen- 
tation of the BR estimator in the 5-year analysis, see 



Section 2.1 of iDunklev et all (|2009D . We use Ka-, Q-, 
and V-band maps for the low-/ polarization analysis in 
1 = 2 — 23, and evaluate the likeliho od directly i n pix el 
space as described in Appendix D of Page et al.l ()2007f ). 

To get a feel for improvements in the low-/ polariza- 
tion data with two additional years of integration, we 
note that the 7-year limits on the optical depth, and 
the tensor-to-scalar ratio and rotation angle from the 
low-/ polariz ation data alone, a re t = 0.088 ± 0.015 
(68% CL; see lLarson et"llll20inh . r < 1.6 (95% CL; see 
Section HII]), and Aa = -3.8° ± 5.2° (68% CL; see Sec- 
tion 14. 5|) , respectively. T he corresponding 5-y ear limits 
were r = 0.087 ± 0.017 ()Dunklev et all [2001. r < 2.7 
(see Se ction l4l|) . and Aa = -7.5°±7.3° (jKomatsu et all 
l2009al) . respectively. 

In Table [HI we summarize the improvements from the 
5-year data mentioned above. 

3.2. External data sets 

The WMAP data are statistically powerful enough to 
constrain 6 parameters of a flat ACDM model with a 
tilted spectrum. However, to constrain deviations from 
this minimal model, other CMB data probing smaller an- 
gular scales and astrophysical data probing the expansion 
rates, distances, and growth of structure are useful. 

3.2.1. Small-scale CMB Data 

The best limits on the primordial helium abundance, 
Yp, are obtained when the WMAP data are combined 
with the power spectrum data from other CMB experi- 
ments probing smaller angular scales, / > 1000. 

We use the temperature power spectra from the Ar- 
cminute Cosmology Bolometer Array Receiver (ACBAR; 
.Reichardt et a l. 2009) and QUEST at DASI (QUaD) 
(jBrown et al.l[2Ci09( ) experiments. For the former, we use 
the temperature power spectrum binned in 16 band pow- 
ers in the multipole range 900 < / < 2000. For the lat- 
ter, we use the temperature power spectrum binned in 
13 band powers in 900 </ < 2000. 

We marginalize over the beam and calibration errors of 
each experiment: for ACBAR, the beam error is 2.6% on 
a 5 arcmin (FWHM) Gaussian beam and the calibration 
error is 2.05% in temperature. For QUaD, the beam error 
combines a 2.5% error on 5.2 and 3.8 arcmin (FWHM) 
Gaussian beams at 100 GHz and 150 GHz, respectively, 
with an additional term acco unting for the sidel obe un- 
certainty (see Appendix A of iBrown et al.l [20091 for de- 
tails). The calibration error is 3.4% in temperature. 

The ACBAR data are calibrated to the WMAP 5-year 
temperature data, and the QUaD data are calibrated 
to the BOOMERanG data (Masi et al. 2006) which are, 
in turn, calibrated to the WMAP 1-year temperature 
data. (The QUaD team takes into account the change 
in the calibration from the 1-year to the 5-year WMAP 
data.) The calibration errors quoted above are much 
greater than the calibration uncertain ty of the WMAP 5- 
year data (0.2% Hinsha w et al.l l200/). This is due to the 
noise of the ACBAR, QUaD, and BOOMERanG data. In 
other words, the above calibration errors are dominated 
by the statistical errors that are uncorrelated with the 
WMAP data. We thus treat the WMAP, ACBAR, and 
QUaD data as independent. 

Figure [7] shows the WMAP 7-year temperature power 
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spectrum (jLarson et al.ll2010l) as well as the temperature 
power spectra from ACBAR and QUaD. 

We do not use the other, previous small-scale CMB 
data, as their statistical errors are much larger than those 
of ACBAR and QUaD, and thus adding them would not 
improve the constraints on the cosmological parameters 
signifi cantly. The new p ower spect rum data from the 
SPT (iLueker et all [Mm and ACT (iFowler et al.llMol) 
collaborations were not yet available at the time of our 
analysis. 

3.2.2. Hubble Constant and Angular Diameter Distances 

There are two main astrophysical priors that we shall 
use in this paper: the Hubble constant and the angular 
diameter distances out to z — 0.2 and 0.35. 

• A Gaussian prior on the present-day Hubble con- 
stant, Hn ^ 74.2 ± 3.6 km Mpc"! (68% CL; 
iRiess et all 120091 ). The quoted error includes both 
statistical and systematic errors. This measure- 
ment of Hq is obtained from the magnitude-redshift 
relation of 240 low-z Type la supernovae at z < 0.1. 
The absolute magnitudes of supernovae are cali- 
brated using new observations from HST of 240 
Cepheid variables in six local Type la supernovae 
host galaxies and the maser galaxy NGC 4258. The 
systematic error is minimized by calibrating su- 
pernova luminosities directly using the geometric 
maser distance measurements. This is a significant 
improvement over the prior that we adopted for 
the 5-year analysis, Hq = 72 ± 8 km s~^ Mpc~^, 
which is from the Hub ble Key Project final results 
(jFreedman et al.|[200l . 

• Gaussian priors on the distance ratios, rs/Dv{z = 
0.2) = 0.1905 ± 0.0061 and rs/Dv{z = 0.35) = 
0.1097 ± 0.0036, measured from the Two-Degree 
Field Galaxy Redshift Survey {2dFGRS) and the 
Sloan Digital Sky Survey Data Release 7 {SDSS 
DR7) UPercival et al.|[2009l ). The inverse covariance 
matrix is given by equation (5) of iPercival et al.l 
(|2009h . These priors are improvements from those 
we adopted for the 5-year analysis, rs/Dv{z = 
0.2) = 0.1980 ± 0.0058 and rjDy iz = 0.35) = 
0.1094 ±0.0033 (IPercival et al.i[2007l) . 

The above measurements can be translated into 
a measurement of rs/Dv{z) at a single, "pivot" 
redshift: rJDy(z = 0.275) = 1390 ± 0.0037 
(IPercival et al.ll2009D . iKazin et all ()2010D used the 
two-point correlation function of SDS'S'-DR7 LRGs 
to measure TslDy(z) at z = 0.278. They found 
rs/Dv{z = 0.278) = 0.1394 ± 0.0049, which is an 
ex cellent agreement wit h the above measurement 
bv IPercival eTall ([2009f ) at a similar redshift. The 
excellent agreement between these two independent 
studies, which are based on very different methods, 
indicates that the systematic error in the derived 
values of rg/ Dv{z) may be much smaller than the 
statistical error. 

Here, is the comoving sound horizon size at the 
baryon drag epoch Zd, 



rs{zd) 



ViJo 



l/(l + 2d) 



da 



For Zd, we use t he fit ting formula proposed by 
lEisenstein fc Hul (I1998D. The effective distance 
measure, Dv{z) (|Eisenstein et al.ll2005D . IS given 
by 



{l + zYD\{z) 



cz 
H{z) 



1/3 



(16) 



where Da{z) is the proper (not comoving) angular 
diameter distance: 



Da{z) 



c fk Ho^/\^^J^ 



Ho 



{i + z)./\n^\ 



(17) 



where fk[x] = sinx, x, and sinhx for flk < 
[k = 1; positively curved), fife = (fc = 0; fiat), 
and Ofc > (fc = — 1; negatively curved), respec- 
tively. The Hubble expansion rate, which has con- 
tributions from baryons, cold dark matter, pho- 
tons, massless and massive neutrinos, curvature, 
and dark energy, is given by equation (P7)) in Sec- 
tion 13.31 



The cosmological parameters determined by combin- 
ing the WMAP data, BAO, and will be called 
"WMAP+BAO+iJo," and they constitute our best esti- 
mates of the cosmological parameters, unless noted oth- 
erwise. 

Note that, when redshift is much less than unity, the ef- 
fective distance approaches Dv{z) — cz/Hq. Therefore, 
the effect of different cosmological models on Dv(z) do 
not appear until one goes to higher redshifts. If redshift 
is very low, Dv{z) is simply measuring the Hubble con- 
stant. 

3.2.3. Power Spectrum of Luminous Red Galaxies 

A combination of the WMAP data and the power 
spectrum of Luminous Red Galaxies (LRGs) measured 
from the SDSS DR7 is a powerful probe of the total 
mass of neutrinos, ^ r riu, and the effecti ve number of 
neutrino species, N^s (jReid et al.l l2010allbh. We thus 
combine the LRG power spectrum (jReid et~ al. 2010a|) 
with the WMAP 7- year data and the Hubble constant 
(|Riess et al.l I2009D t o update the cons traints on ^ 
and iVeff reported in lReid et all (|2010aD . Note that BAO 
and the LRG power spectrum cannot be treated as inde- 
pendent data sets because a part of the measurement of 
BAO used LRGs as well. 

3.2.4. Luminosity Distances 

The luminosity distances out to high-z Type la su- 
pernovae have been the most powerful data for first dis- 
covering the existence of dark energy (jRiess et al.l 119981 : 
iPerlmutter et al.lll999f l and then constraining the prop- 
erties of dark e nergy, such as the e quation of state pa- 
rameter, w fsee lFrieman et al.ll2008l for a recent review) . 
With more than 400 Type la supernovae discovered, the 
constraints on the properties of dark energy inferred from 
Type la supernovae are now limited by systematic errors 
rather than by statistical errors. 

There is an indication that the constraints on dark 
energy parameters are different when different meth- 
ods are used to fit t he ligh t curves of Type la super 



(15) novae (|Hicken et al.l l2009bl: iKessler et alfjoogl) . 'We 
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Fig. 7. — The WMAP 7-year temperature power spectrum dLar son et al.l 120101 ), along with the temperature power spectra from the 
ACBAR ( Rcichardt et al. 2009) and QUaD ( Brown et al. 2009) experiments. We show the ACBAR and QUaD data only at I > 690, where 
the errors in the WMAP power spectrum are dominated by noise. We do not use the power spectrum at I > 2000 because of a potential 
contribution from the SZ effect and point sources. The solid line shows the best-fitting 6-parameter flat ACDM model to the WMAP data 
alone (see the 3rd column of Table [T] for the maximum likelihood parameters). 



also found that the parameters of the minimal 6- 
pa rameter ACDM mod el derived from two compilations 
of IKessler et"all (l2009l) are different: one compilation 
uses the light curve fitter called SALT-II (|Guv et al.ll2007[) 
while the o t her us es the light curve fitter called MLCS2K2 
(jJha et al.l 120071) . For example, Qa derived from 
WMAP-t-BAO-t-SALT-ll and WMAP-fBAO+MLCS2K2 
are different by nearly 2cr, despite being derived from the 
same data sets (but processed with two different light 
curve fitters). If we allow the dark energy equation of 
state parameter, w, to vary, we find that w derived from 
WMAP-f BAO+SALT-I I and WMAP+BAO+MLCS2K2 
are different by ^ 2.5(7. 

At the moment it is not obvious how to estimate sys- 
tematic errors and properly incorporate them in the like- 
lihood analysis, in order to reconcile different methods 
and data sets. 

In this paper, we shall use one compilation of 
the supernova data called the "Constitution" samples 
(|Hicken et al.ll2009bD . The reason for this choice over 
the o thers, such as the compilation by IKessler et al.l 
(|2009f) that includes the latest data from the SDSS-II 
supernova survey, is that the Constituti on samples are 
an extension of the "Union" samples (jKowalski et al.l 
[2OO8) that we used for th e 5-year analysis (see Sec- 
tion 2.3 of IKomatsu et all ^2009a^ . More specifically, 
the Constitution samples are the Union samples plus 
the latest samples of nearby Type la supernovae opti- 
cal photometry from the Center f or Astrophysi c s (CfA ) 
supernova group (CfA3 sample; IHicken et all l2009aD . 



Therefore, the parameter constraints from a combina- 
tion of the WM AP 7-year data, the latest BAG data 
described above (jPercival et al.|[2009[ ). and the Consti- 
tution supernova data may be directly compared to the 
" W MAP-HB AO-FSN " param eters given in Table 1 and 
2 of IKomatsu et all (j2009aD . This is a useful compari- 
son, as it shows how much the limits on parameters have 
improved by adding two more years of data. 

However, given the scatter of results among different 
compilations of the supernova data, we have decided to 
choose the ''WMAP+BAO+Hq' (see Section IX^ as 
our best data combination to constrain the cosmologi- 
cal parameters, except for dark energy parameters. For 
dark energy parameters, we compare the results from 
WMAP+BAO+Ho and WMAP-^BAO-hSN in Section^ 
Note that we always marginalize over the absolute mag- 
nitudes of Type la supernovae with a uniform prior. 

3.2.5. Time-delay Distance 

Can we measure angular diameter distances out to 
higher redshifts? Measurements of gravitational lensing 
time delays offer a way to determine absolute distance 
scales (Refsdal 1964). When a foreground galaxy lenses a 
background variable source (e.g., quasars) and produces 
multiple images of the source, changes of the source lu- 
minosity due to variability appear on multiple images at 
different times. 

The time delay at a given image position 6 for a given 
source position /3, t{9, f3), depends on the angular diam- 
eter distances as (see, e.g.. iSchneider et al... 2006k for a 
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1 + zi DiDs 
c Dis 



(18) 



where Di, Dg, and Dig are the angular diameter distances 
out to a lens galaxy, to a source galaxy, and between 
them, respectively, and (/)f is the so-called Fermat po- 
tential, which depends on the path length of light rays 
and gravitational potential of the lens galaxy. 

The biggest challenge for this method is to control sys- 
tematic errors in our knowledge of (jj-p, which requires a 
detailed modeling of mass distribution of the lens. One 
can, in principle, minimize this systematic error by find- 
ing a lens system where the mass distribution of lens is 
relatively simple. 

The lens system B1608-f656 is not a simple system, 
with two lens galaxies and dust extinction; however, it 
has one of the most precise time-delay measurements of 
quadruple lenses. The le ns redshift of this s ystem is rela- 
tively large, zi = 0.630 4 (iMvers et al.lll995D The source 
redshift is = 1.394 ()Fassnacht et al.lll996D . This sys- 
tem has been used to determine Hq to 10% accuracy 
([K popmans e t al. 2003). 

ISuvu et al.1 ([2009) have obtained more data from the 
deep HST Advanced Camera for Surveys (ACS) obser- 
vations of the asymmetric and spatially extended lensed 
images, and constrained the slope of mass distribution of 
the lens galaxies. They also obtained ancillary data (for 
stellar dynamics and lens environment studies) to con- 
trol the systematics, particularly the the so-called "mass- 
sheet degeneracy," which the strong lensing data alone 
cannot break. By doing so, they were able to reduce the 
error in Hp (including th e systematic error) by a factor 
of two (|Suvu et al.ll2010() . They find a constraint on the 
"time-delay distance," i?At, as 



DAt = (1 + zi) 



D\Ds 



5226 ± 206 Mpc, 



(19) 



where the number is found from a Gaussian fit to the 
likelihood of -DaO; however, the actual shape of the 
likelihood is slightly non-Gaussian. We thus use: 

• Likelihood of D^t out to t he le ns system 
B1608+656 given bv lSuvu et all (|2010l ). 

, , expf-(ln(x-A)-/i)V(2a2)l ^ ^ 



2tt{x — \)a 

where x = D^tKl Mpc), A = 4000, ^ = 7.053, 
and (T = 0.2282. This likelihood includes system- 
atic errors due to the mass-sheet degeneracy, which 
dominates the to tal error budget (see Section 6 of 
ISuvu et al.l[20T0l for more details). Note that this 
is the only lens system for which D/^t (rather than 
Hq) has been constrained^ 

^"^ S. H. Suyu 2009, private communication. 

As the time-delay distance, D/^t, is the angular diam- 
eter distance to the lens, Di, multiplied by the distance ra- 
tio, Da/Dig, the sensitivity of -Da* to cosmological parameters 
is so mewhat limited compared to that of Di flFukugita ct al. 
119901) . On the other hand, if the density profile of the lens 
galaxy is approximately given by p oc the observed Ein- 

stein radius and velocity dispersion of the lens galaxy can be 
used to infer the same distance ratio, Ds/Di^, and thus one 



3.3. Treating Massive Neutrinos in H{a) Exactly 

When we evaluate the likelihood of external astrophys- 
ical data sets, wc often need to compute the Hubble ex- 
pansion rate, H{a). While we treated the effect of mas- 
sive neutrinos on H{a) approximately for the 5-year anal- 
ysis of the external data sets presented in lKomatsu et all 
(20093) J "we treat it exactly for the 7- year analysis, as de- 
scribed below. 

The total energy density of massive neutrino species, 
Pi/, is given by (in natural units) 



p^{a) 2 



/ (t^3,p/T.(a)+l E\/p'+<>' (21) 



where mn^i is the mass of each neutrino species. Using 
the comoving momentum, q = pa, and the present-day 
neutrino temperature, T^q = (4/ll)i/^rcmb = 1-945 K, 
we write 

i 

Throughout this paper, we shall assume that all massive 
neutrino species have the equal mass m,y, i.e., m^^i = mi, 
for alUEB 

When neutrinos are relativistic, one may relate pi, to 
the photon energy density, p-y, as 



Pu{a) 



7/4 
IT 



4/3 



N,sp^{a) ~ 0.2271iVcffp^(a), (23) 



where A'cff is the effective number of neutrino species. 
Note that A'cff — 3.04 for the standard neutrino 
speciesEB This motivates our writing equation ([22l) as 



p,(a) = 0.2271A^cffP7(a)/("^.a/T,o), (24) 



where 



^, . 120 p , x^^/x^+y^ 



(25) 



The limits of this function are f{y) — > 1 for y — > 0, 
and f{y) i^^y for y ^ oo, where C(3) ~ 1.202 is 
the Riemann zeta function. We find that f{y) can be 



can u se this property to constrai n cosmological parameters as 
well HFi itamase fc Yoshida 2001; Yamamoto & Futamasc 200 1|; 
lYamamo to ct al. 2001; Ohvama ct al. 2002; Dobkc ct al. 2009), up 
to uncertainties in the density profile tChiba fc Takahashi. 2002). 
By combining measurements of the time-delay, Einstein ring, and 
velocity dispersion, one can in principle measure Di directly, 
thereby turning strong gravitational lens systems into standard 
rulers i Paraficz fc Hiorth.200 9). While the accuracy of the current 
data for B1608+656 does not permit us to determine precisely 
yet (S. H. Suyu and P. J. Marshall 2009, private communication), 
there seems to be exciting future prospects for this metho d. Fu- 
ture p r ospects of the time-delay method are also discussed in lOguril 
l|2007l ): ICoTS Moustakas (2009). 

While the current cosmological data are not yet sensitive 
to the mass of individual neutrino species, that is, the mass 
hierarchy, this situation may change in the futur e, with high 
z galaxy redshift surveys or weak lensing surveys (Takada et al 
[2006; Siosar 2006; Hanncstad & Wong 2007; Kitching et al. 200" 
Rbdalla fc Rawlings»2007 ). 

A recent estimate gives A^off = 3.046 IjMangano et al.ll2005l ) 
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approximated by the following fitting formula^ 
/(y)«[l + (^yn^/^ 



(26) 



where A 



180C(3) 



0.3173 and p = 1.83. This fit- 



ting formula is constructed such that it reproduces the 
asymptotic limits in y — >■ and ?/ — >■ cxd exactly. This 
fitting formula underestimates f{y) by 0.1% at y ~ 2.5, 
and overestimates by 0.35% at y ~ 10. The errors are 
smaller than these values at other y's. 
Using this result, we write the Hubble expansion rate 

as 

H{a) ^Ho\ \ + % [1 + 0.2271iVcff /(m,a/T,o)] 



1/2 



(27) 



2.725 K. Us- 



q2 g3(l+iOeff(a)) 

where il^ = 2.469 x IQ-^h-'^ for T„„b 
ing the massive neutrino density parameter, fl^h^ = 
X]™i^/(94 eV), for the standard 3 neutrino species, we 
find 

m^a 187 f n^h^\ 

(28j 



1 + z V 10 



One can check that {n^/a'^)0.2271Ncsf{m^a/T^o) 
rjjy/a'^ for a oo. One may compare equation ((27)) . 
which is exact (if we compute f (y) exactly), to equa- 
tion (7) of lKomatsu et all ()2009al i. which is approximate. 

Throughout this paper, we shall use to denote the 
dark energy density parameter at present: VIa = Qdeiz = 
0). The function Woff(a) in equation ^17} is the effec- 
tive equation of state of dark energy given by Weff (a) = 

Wa /o""'^^^'^' ""^("^Oj ^-^d w{a) is the usual dark energy 
equation of state, i.e., the dark energy pressure divided 
by the dark energy density: w{a) = Pde{o) I Pde{a)- For 
vacuum energy (cosmological constant), w does not de- 
pend on time, and w — —1. 

4. COSMOLOGICAL PARAMETERS UPDATE EXCEPT 
FOR DARK ENERGY 

4.1. Primordial Spectral Index and Gravitational Waves 

The 7-year WMAP data combined with BAO and Hq 
exclude the scale-invariant spectrum by 99.5% CL, if we 
ignore tensor modes (gravitational waves). 

For a power-law power spectrum of primordial curva- 
ture perturbations TZk, i.e.. 



kH\nk\ 



^A^(M(^ 



(29) 



27r2 

where fco = 0.002 Mpc"\ we find 

Us = 0.968 ±0.012 (68% CL). 

For comparison, the WMAP da. t a-only limit is 
Us = 0.967 ± 0.014 ([Larson et al.l [20Tol ). and the 
WMAP plus the small -scale CMB e xperiments ACBAR 
()Reichardt et al.l [20091) and QUaD ([Brown et al.l [20091) 

Also see Section 5 of lWrightj 1 1200^) . where pi/ is normalized 
by the density in the non-relativistic limit. Here, p,j is normalized 
by the density in the relativistic limit. Both results agree with the 
same precision. 
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Tensor-to-scalar Ratio (r) 

Fig. 8. — Limits on the tensor-to-scalar ratio, r, from the po- 
larization data (BB, EE and TE) alone. All the other cosmo- 
logical parameters, including the optical depth, a re fixed at the 
5-year best- fit ACDM model ijDunklev et al.1 120091 ). The vertical 
axis shows — 2 ln(L/Lniax), where L is the likelihood and Lmax 
is the maximum value. This quantity may be interpreted as the 
standard i as the likelihood is approximately a Gaussian near the 
maximum; thus, — 2 ln(L/Ljnax) = 4 corresponds to the 95.4% CL 
limit. The solid, dashed and dot-dashed lines show the likelihood 
as a function of r from the BB-only, BB-fEE, and BB-I-EE-I-TE 
data. (Left) The 7-ycar polarization data. We find r < 2.1, 1.6, 
and 0.93 (95.4% CL) from the BB-only, BB+EE, and BB-f EE-(-TE 
data, respectively. (Right) The 5-year polarization data. We find 
r < 4.7, 2.7, and 1.6 (95.4% CL) from the BB-only, BB-(-EE, and 
BB-I-EE-I-TE data, respectively. 



is Us = 0.966^ 013. explained in Section 3.1.2 of 
iKomatsu et al.l ([2009al ) , the small-scale CMB data do not 
reduce the error bar in Us very much because of relatively 
large statistical errors, beam errors, and calibration er- 
rors. 

How about tensor modes? While the B-mode 
olarization is a smoking-gun for tensor mode s 
Seliak & Zaldarriaga 1997^ Ka mionkowski et al.l 1199/) . 
the WMAP data mainly constrain the amplitude of ten- 
sor modes by th e low-? temperature po wer spectrum (see 
Section 3.2.3 of lKomatsu et al.|l2009aD . Nevertheless, it 
is still useful to see how much constraint one can obtain 
from the 7-year polarization data. 

We first fix the cosmological parameters at the 5-year 
WMAP best-fit values of a power-law ACDM model. We 
then calculate the tensor mode contributions to the B- 
mode, E-mode, and TE power spectra as a function of 
one parameter: the amplitude, in the form of the tensor- 
to-scalar ratio, r, defined as 



A^(fco)' 



(30) 



where A^j(fc) is the power spectrum of tensor metric per- 
turbations, hk, given by 



4P(|fefc| 
27r2 



Ai(..) I i 



(31) 



In Figure [8l we show the limits on r from the B-mode 
power spectrum only (r < 2.1, 95% CL), from the B- 
and E-mode power spectra combined (r < 1.6), and from 
the B-mode, E-mode, and TE power spectra combined 
(r < 0.93). These limits are significantly better than 
those from the 5-year data (r < 4.7, 2.7, and 1.6, re- 
spectively) , because of the smaller noise and shifts in the 
best-fitting values. For comparison, the B-mode power 
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TABLE 7 

Primordial tilt tis, running index dris/dlnk. and tensor-to-scalar ratio r 



Section 


Model 


Parameter'^ 


7-year WMAP^ 


WMAP+ACBAR-t-QUaD'^ WMAP+BAO+Ho 


Section mH 


Power-law^ 


ris 


0.967 ± 0.014 


0.966_o 


0.968 ± 0.012 


Section 14^ 


Running 


dris/dln k 


-0.051 

-0.034 ± 0.026 


-0.046 

-0 041+° °^^ 


1.008 ± 0.042' 
-0.022 ± 0.020 


Section rni 


Tensor 


ris 
r 


n qoo + O.liaU 
u. JO^'-O.OIO 

< 0.36 (95% CL) 


979+" "^" 
< 0.33 (95% CL) 


0.973 ± 0.014 
< 0.24 (95% CL) 


Sectionl4.2l 


Running 
-t-Tensor 


ris 
r 

dus /dink 


1.076 ± 0.065 
< 0.49 (95% CL) 
-0.048 ± 0.029 


N/A 


1.070 ± 0.060 
< 0.49 (95% CL) 
-0.042 ± 0.024 



^ Defined at ko ^ 0.002 Mpc"^. 

^[Larson ct al. ( 2 0101) . 

ACBAR iRcichardt et al.|[200g|) : QUaD JBrown et aI1l200gD . 

The parameters in this row are based on RECFAST version 1.5 (see Appendix^J, while the parameters 
in all the other rows are based on RECFAST version 1.4.2. 

At the pivot point for WMAP only, where and dris /dink are uncorrelated, (/Cpivot ) — 
0.964 lb 0.014. The "pivot wavenumber" may be defined in two ways: (i) fcpivot — 0.0805 Mpc""^ 
from ns(/Cpivot) — ns{ko) + ^ (dris /d In fc) ln(fcpivot /^o) , or (ii) fcpivot — 0.0125 Mpc"-'- from 
d\nA^/dlnk\ ^ Usiko) - 1 ~\- {dus/dlnk) ln(A;pivot /^o) • 



At the pivot point for WMAP+BAO+i/o , where rig and dUs /d\n k are uncorrelated, 
0.964 ± 0.013. The "pivot wavenumber" may be defined in two ways: (i) fcpivot — 



from ns(fcpivot) — ^^s(fco) + | (dn^ /d In fc) ln(A;pivot /A;o) , or (ii) 
dlnA^/(iln/c|^_^ — ns(fco) — 1 + {dris/d Ink) ln(fcpivot /ko). 



0.0155 



Tig (/Cpivot ) 

0.106 Mpc" 
Mpc"^ ft-o 



spectrum fro m the BICEP 2 -year data gives r < 0.73 
(95% CL; Chi ang et al.l[20lol) . 

If we add the temperature power spectrum, but still 
fix all the other cosmological parameters including Ug, 
then we find r < 0.15 (95% CL) from both 5-year and 
7-year data; however, due to a strong correlation be- 
tween Us and r, this would be an underestimate of the 
error. For a 7-parameter model (a flat ACDM model 
with a tilted spectrum, tensor modes, and rit — — r/8), 
we find r < 36 (95 % CL) from the WMAP data alone 
(|Larson et al.ll20m) . r < 0.33 (95% CL) from WMAP 
plus ACBAR and QUaD, 

r < 0.24 (95% CL) 

from WMAP+BAO+Ho, and r < 0.20 (95% CL) from 
WMAP+BAO+SN, where "SN" i s the C onstitution 
samples compiled by Hic ken et al.l ()2009b[ ) (see Sec- 
tion [3X4]). 

We give a summary of these numbers in Table [T) 

4.2. Running Spectral Index 

Let us relax the assumption that the power spectrum is 
a p ure power-law, and add a "running index," dus/dhik 
as (|Kosowskv fc TurnedlT995h 



-1.2, -2.6, and 
-ACBAR-fQUaD, 
See Table [7] for 



fT-siko) — !-^^ \n(k/ kQ)dns /d\nk 



(32) 



Ignoring tensor modes again, we find 

dus/dhik = -0.022 ±0.020 (68% CL), 

from WMAP+BAO+Ho- For comparison, the WMAP 
data-only limit is dus/dlnk = -0.034 ± 0.026 
()Larson et al.|[20Tol ). and the WMAP-H ACBAR-^QUaD 
limit is dns/d\nk = -OMltoml- 

None of these data combinations require 
dug/ dhik: improvements in a goodness-of-fit rel- 
ative to a power-law model (equation (p9)) ) are 



^X"" — ^ ll^(.^running/.^powcr— law) — 

-0.72 for the WMAP-only, WMAP^ 
and WMAP-f-BAO-l-i/o, respectively, 
the case where both r and dus/dhi k are allowed to vary. 

A simple power-law primordial power spectrum with- 
out tensor modes continues to be an excellent fit to the 
data. While we have not done a non-parametric study 
of the shape of the power spectrum, recent studies after 
the 5-year data release continue to show that there is no 
convincing deviation f rom a simple power-law spectrum 
(IPeiris fc Verdd I20T0I: llchiki et al.l [20T0I: IHamann et all 

miW . 

4.3. Spatial Curvature 

While the WMAP data alone cannot constrain the spa- 
tial curvature parameter of the observable universe, fife, 
very well, combining the WMAP data with other dis- 
tance indicators s uch as Hp, BAQ^ o r supernovae can 
constrain Qk (e.g.. iSpergel et ai]|2007[ ). 

Assuming a ACDM model {w = —1), we find 

-0.0133 < f7fe < 0.0084 (95% CL), 

from WMAP+BAO+Ho^ However, the limit weakens 
significantly if dark energy is allowed to be dynamical, 
u; ^ -1, as this data combination, WMAP-|-BAO+i7o, 
cannot constrain w very well. We need additional infor- 
mation from Type la supernovae to constrain w and fl k 
simultaneously (see Section 5.3 of lKomatsu et al.ll2009a[ ) . 
We shall explore this possibility in Section [5l 

4.4. Non-Adiabaticity: Implications for Axions 

Non-adiabatic fluctuations are a powerful probe of 
the origin of matter and the physics of infla tion. Fol- 
lowing Section 3.6 of iKomatsu et al.l (|2009al) . we focus 
on two physically motiva ted models for non-ad i abatic 
fluctuations: axion-type (jSeckel fc Turneij Il985t iLindd 



The 68% CL limit is 0.^ 



-0 0023+° °°^'' 
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[19851 [l99l iTurner fc Wilc zekl H oH and curvaton- 
tvpe (iLinde fc MukhanoTfl 997: L ^h fc Wand^ [20031: 
IMoroi fc Takahashill200lL " l2002t iBartolo fc Liddldl200l) . 

For both cases, we consider non-adiabatic fluctuations 
between photons and cold dark matter: 



5=^ 



(33) 



and report limits on the ratio of the power spectrum 
of S and that o f the curvature perturbation TZ (e.g., 
iBean et al.|[2006[ ): 



l-a{ko) PnikoV 



(34) 



where fco — 0.002 Mpc~ . We denote the limits on axion- 
type and curvaton-type by a-nd a-_i, respectivelv[^ 

We find no evidence for non-adiabatic fluctuations. 
The WMAP data-only limits are ao < 0. 13 (95% CL) 
and Q-1 < 0.011 (95% CL) (95% CL; ILarson et al.l 
[20Tol ). With WMAP-hBAO-hi?o, we find 



ao < 0.077 (95% CL) and a_i < 0.0047 (95% CL), 



< 



while with WMAP-hBAO-hSN, we find ao 
0.064 (95% CL) and a_i < 0.0037 (95% CL). 

The limit on ao has an important implication for ax- 
ion dark matter. In particular, a limit on ao is related 
to a limit on the tensor-to-scalar ratio, r (Kain 2006. 



Beltran et al.l 120071: ISikivid 120081: [Kawasaki fc Sekiguchi 
20081). The explicit formula is given by equation (48) of 



Komatsu et"all (|2009al) aS 



4.7 X 10-12 fQ^H 



glO/7 



7 



12/7 



2/7 



ao 



1 - ao 



(36) 



where Vta < is the axion density parameter, 9a is 
the phase of the Peccei-Quinn field within our observ- 
able universe, and 7 < 1 is a "dilution factor" represent- 
ing the amount by which the axion density parameter, 
Vlah? , would have been diluted due to a potential late- 
time entropy production by, e.g., decay of some (unspec- 
ified) heavy particles, between 200 MeV and the epoch 
of nucleosynthesis, 1 MeV. 

Where does this formula come from? Within the con- 
text of the "misalignment" scenario of axion dark mat- 
tej^. there are two observables one can use to place lim- 
its on the axion properties: the dark matter density and 

The limits on a can also be converted into the numbers 
showing "how much the adiabatic relation (iS = 0) can be violated," 
(5adi, which can be calculated from 



^[<5pc/pc-H3<5p^/(4p^)] 



1, 



(35) 



for a < 1 IIKomatsu et al.ll2009a l ) . 

This formula assumes that the axion field began to oscillate 
before the QCD phase transition. The formula in the other limit 
will be given later. We shall assume that the energy density of the 
universe was dominated by radiation when the axion field began 
to os cillate; however, this may not always be true (Kawasaki ct al. 
[199^ : [Kawasaki fc Tak ahashi 200^) when there was a significant 
amount of entropy production after the QCD phase transition, i.e., 
7 < 1- 

We make the following assumptions: the Peccei-Quinn sym- 
metry was broken during inflation but before the fluctuations we 
observe today left the horizon, and was not restored before or after 
the end of inflation (reheating). That the Peccei-Quinn symme- 



ao. They are given by fe.g.. lKawasaki fc Sekiguchill2008l 

and references therein) 



l-ao(fc) aiei{fjM,,,f 



fa 



IQi" GeV 



7/6 



(37) 



(38) 



where fa is the axion decay constant, and e — — / H?^^ 
is the so-called slow-roll parameter (where Hi^f is the 
Hubble expansion rate during inflation). For single- field 
infiation models, e is related to r as r = 16e. By eliminat- 
ing the axion decay constant, one obtains equation (1551) . 

In deriving the above formula for ilah'^ (equation ([38|) ). 
we have assumed that the axion fleld began to oscillate 
before the QCD phase transition(3 This is true when 
/, < O(10-2)Mp,; however, when /, > O{10-^)Mpi, 
the axions are so light that the axion field would not 
start oscillating after the QCD phase transition(3 In 
this limit, the formula for flah^ is given by 



^ah^ = 1.6 X 10^61^ 



fa 



1017 GcV 



3/2 



(39) 



By eliminating fa from equation (j37p and p9p . we obtain 
another formula for r: 



4.0 X 10 
^273" 



-10 



7 



4/3 



fla 



2/3 



ao 



1 - ao 



(40) 



Equation ([5^ and combined with our limits on 

rich^ and ao, implies that the axion dark matter scenario 
in which axions account for most of the observed amount 
of dark matter, fla ^Ic, must satisfy 

7 6 X 10^1^ 

r< .0/7 1,/, for/.<O(10-2)Mp,, (41) 
15x1 0^12 

r< . for > O(10-2)AV. (42) 



^2/3^/3 



try was not restored before reheating requires the expansion rate 
during infiation not to exceed the axion decay constant, < fa 

HLvth fc Stewar3ll992l 'l. That the Peccei-Quinn symmetry was not 
restored after reheating requires the reheating temperature after 
inflation not to exceed fa- 

^® Specifically, the temperature at which the axion field began 
to oscillate, Ti, can be calculated from the condition 3H{Ti) = 
ma{Tx), where ma{T) ^ 0.1mao(0.2 GeV /T^ is the mass of 
axions before the QCD phase transition, T > 0.2 GeV, and 
"niaO = 13 MeV(l GeV/ fa) is the mass of axions at the zero 
temperature. Here, we have used the pion decay constant of 
= 184 MeV to calculate niao, following equation (3.4.16) of 
I Weinberg II2008I ). The Hubble expansion rate during radiation era 
isgivenby M2ji/2(T) = (7r2/90)g,T4, where Mpj = 2.4xl0l** GeV 
is the reduced Planck mass and g, is the number of relativistic de- 
grees of freedom. Before the QCD phase transition, g, = 61.75. 
After the QCD phase transition but before the electron-positron 
annihilation, g» = 10.75. 

27 This dividing point, fa = O(10-^)Mpi, can be found from 
the condition Ti = 0.2 GeV and 3H(Ti) = ma(Ti). See 
Hcrtzbcrg c t~al] 112003) for more accurate numerical estimate. Note 
that Hertz berget'^ini200SI ) used F-^ = 93 MeV for the pion decay 
constant when calculating the axion mass at the zero temperature. 
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Alternatively, one can express this constraint as 



6la7^/^<3.3 X 10" 



10- 



-2\ 7/10 



for fa < O(10-2)Mp,, 



9aY <1.8 X 10 



-15 



10 



-2\ 3/2 



for fa > O(10-2)Mp 



Therefore, a future detection of tensor modes at the level 
of r = 10~^ would i mply a fine-tu ning of 0a or 7 or both 
of these parameters (|Komatsu et al. 2009j3). If such fine- 
tunings are not permitted, axions cannot account for the 
observed abundance of dark matter (in the misalignment 
scenario that we have considered here). 

Depending on one's interest, one may wish to elimi- 
nate the phase, leaving the a xion decay constant in the 
formula (see equation (B7) of lKomatsu et al1l2009aD : 



r = (1.6x10^^^) 



f \ 5/6 



6/5 



^laj GeVy 1 - ao 

(43) 

for / < 0{l()-'^)Mpi. This formula gives 

fa > 1.8 X 10^6 GeV 7«/5 (^) 

which is inconsistent with the condition fa < 
O{10~^)Mpi (unless r is extremely small). The formula 
that is vahd for / > O{10-'^)Mpi is 



(44) 



r = (2.2x10-®) 



7 



fa \ "0 



naj VlO^^ GeVy 1 -ao' 

(45) 



which gives 

fa > 3.2 X 10^^ GeY ^'[j^y . (46) 
Requiring fa < Mpi = 2.4 x 10^® GeV, we obtain 

(47) 



r < 



.7 X 10 



-10 



7 



Thus, a future detection of tensor modes at the level of 
r = 10^^ implies a significant amount of entropy produc- 
tion, 7 1, or a super-Planckian axion decay constant , 
/„ » M„;, or both. Also see [H ertzbc rg et al.l (|2008f ): 
[Maci3 ([2009l ): lMack fc Steinhardtl (|2009l) for similar stud- 
ies. 

For the implicati ons of Q-i for curvaton dark matter, 
see Section 3.6.4 of lKomatsu et al.l (|2009aD . 

4.5. Parity Violation 

While the TB and EE correlations vanish in a 
parity-conserving universe, they may not vanish when 
global parity symmetry is broken on cosmological scales 
(|Lue et al.l fl999l: iCarrollI [1998). In pixel space, they 
would show up as a non- vanishing (Ur). As we showed 
already in Section the WMAP 7-year {Ur) data are 
consistent with noise. What can we learn from this? 

It is now a routine work of CMB experiments to deliver 
the TB and EB data, and constrain a rotation angle of 
the polarization plane due to a parity- violating effect (or 
a rotation due to some systematic error). Specifically, a 



rotation of the polarization plane by an angle Aa gives 
the following 5 transformations: 



TE,obs ^TE /OA ^ 

; =6; cos(2Aa), 

TB,obs^^TEgij^(2Aa), 

™'°'^^ = C™cos2(2Aa), 

BB,obs^^EEgj^2(2A«), 



EB.obs 



2^' 



EE 



sin(4Aa), 



(48) 
(49) 
(50) 
(51) 

(52) 



where C/'s on the right hand side are the primordial 
power spectra in the absence of rotation, while C°^^'s 
on the left hand side are what we would observe in the 
presence of rotation. 

Note that these equations are not exact but valid only 
when the primordial i?-mode polarization is negligible 
compared to the i?-mode polarization, i.e., <^ Cf^. 
For the fu h expressio n incl uding C^^, see iLue et ahl 
(1999) and lFeng et all (|2005[ ). 

Roughly speaking, when the polarization data are still 
dominated by noise rather than by the cosmic signal, the 
uncertainty in Aa is given by a half of the inverse of the 
signal-to- noise ratio of TE or EE, i.e.. 



Err[Aa™l 



1 



Err[A( 



,EBi 



2(^/7V)TE^ 
1 

2(^/7V)EE- 



(Note that we use the full likelihood code to find the best- 
fitting values and error bars. These equations should 
only be used to provide an intuitive feel of how the er- 
rors scale with signal-to-noise.) As we mentioned in the 
last paragraph of Section 12. 4[ with the 7- year polariza- 
tion data we detect the TE power spectrum at 21cr from 
; = 24 to 800. We thus expect Err[Aa'^^] ~ 1/42 ~ 
0.024 radian ~ 1.4°, w hich is signifi cantly better than 
the 5-year value, 2.2° (iKo matsu eFa l. 2009a). On the 
other hand, we detect the EE power spectrum at ^ > 24 
only at a few a level, and thus Err[Aa^^] > Err[Aa'^^], 
implying that we may ignore the high-/ EB data. 

The magnitude of polarization rotation angle, Aa, de- 
pends on the path length over which photons experi- 
enced a parity-violating interaction. As pointed out by 
ILiu et al. (2006), this leads to the polarization angle that 
depends on I. We can divide this /-dependence in two 
regimes: (i) I < 20: the polarization sig nal was generated 
during reionization ()Zaldarriagalll997r ). We are sensitive 
only to the polarization rotation between the reionization 
epoch and present epoch, (ii) I > 20: the polarization 
signal was generated at the decoupling epoch. We are 
sensitive to the polarization rotation between the decou- 
pling epoch and present epoch; thus, we have the largest 
path length in this case. 

Using the high-/ TB data from / = 24 to 800, we 
find Aa = —0.9° ± 1.4°, which is a significant improve- 
ment over the 5-year h igh-/ result, Aa = — 1.2° ± 2.2° 
(jKomatsu et al.ll2009aD . 

Let us turn our attention to lower multipoles, / < 23. 
Here, with the 7-year polarization data, the EE power 
spectrum is detected at 5.1tT, whereas the TE power 
spectrum is only marginally seen (1.9a). (The overall 
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significance level of detection of the E-model polariza- 
tion at I < 23, including EE and TE, is 5.5a.) We there- 
fore use both the TB and EB data at / < 23. We find 
Aa = —3.8° ± 5.2°, which is also a good improvement 
over the 5-year low-Z value, Aa ~ —7.5° ± 7.3°. 

Combining the low-^ TB/EB and high-Z TB data, we 
find Aa = —1.1° ± 1.4° (the 5-year combined limit was 
Aa — —1.7° ±2.1°), where the quoted error is purely 
statistical; however, the WMAP instrument can mea- 
sure the polarization angle to within ±1.5° of the design 
orientation ([Page et al. 2003, 2007). We thus add 1.5° 
as an estimate of a potential systematic error. Our final 
7-year limit is 

Aa = -1.1° ± 1.4° (stat.) ± 1.5° (syst.) (68% CL), 

or -5.0° < Aa < 2.8° (95% CL), for which we have 
added the statistical and systematic errors in quadrature 
(which may be an under-estimate of the total error). The 
statistical error and systematic error are now compara- 
ble. 

Several research groups have obtained limits 
on Aa from vario us data s ets (iFeng et al.l 2006 
Kosteleckv fc MewesI [20071 : iCabella et all 12007 



the WMAP+BAO+Ho limit is 



< 0.58 eV (95% CL) (for w = -I) 



2009; 



Xia et all I2008t IXia etaLT I2008bt IWu'erall 
Gubitosi et a l.l I2009D . ^c entlv. the BOOMERanG 
collaboration (jPagano et al.l I2OO90 revisited a limit 
on Aa from their 2003 flight (B2K), taking into 
account the effect of systematic errors rotating the 
polarization angle by —0.9° ± 0.7°. By removing 
this, they find Aa = -4.3° ± 4.1° (68% CL). The 
QUaD collaboration used their final data set to find 
Aa = 0.64° ± 0. 5 0° (stat.) ± 0.50 ° (syst.) (68% CL; 
IBrown et al.|[2009h. IXia et al l ^QUh used the BICEP 2- 
year data (|Chiang et al.ll2oTo[ i to find Aa = -2.6° ± 1.0° 
(68% CL statistical); however, a systematic error of 
±0.7° needs to be added to this error budget (see 
"Polarization orientation uncertainty" in Table 3 of 
[Takahashi et al. 2010). Therefore, basically the sys- 
tematic errors in recent measurements of Aa from 
WMAP 7-year, QUaD final, and BICEP 2-year data are 
comparable to the statistical errors. 

Adding the statistical and systematic errors in quadra- 
ture and averaging over WMAP, QUaD and BICEP with 
the inverse variance weighting, we find Aa — —0.25° ± 
0.58° (68% CL), or -1.41° < Aa < 0.91° (95% CL). 
We therefore conclude that the microwave background 
data are comfortably cons istent with a parity-co n servin g 
universe. See , e.g ., iKosteleckv fc MewesI (|2008D : 
lArvanitaki et al.l ()2009l ) and references therein for impli- 
cations of this result for potential violations of Lorentz 
invariance and CPT symmetry. 

4.6. Neutrino Mass 



Following Section 6.1 of iKomatsu et al.l ()2009aj) (also 
see references therein), we constrain the total mass of 
neutrinos, ^ rrii, = 94 eV{n^h'^), mainly from the 7-year 
WMAP data combined with the distance information. A 
new component in the analysis is the exact treatment of 
massive neutrinos when calculating the likelihoo d of the 
BAO data, as described in Section 13.31 (also see iWrightl 
l2006l) . 

For a flat ACDM model, i.e., w = -1 and flk = 0, the 
WMAP-only limit is ^ < 1.3 eV (95% CL), while 



The latter is the best upper limit on ^ rrii, without in- 
formation on the growth of structure, which is achieved 
by a better measurement of the early Integrated Sachs- 
Wolfe (ISW) effect through the third aco ustic peak of 
the 7 -year temperature power spectrum (jLarson et al.l 
20101). as w e ll as by a better determination of Hq from 
Riess et all (I2Q09D. F or explanations of this effect, see 



Ichikawa et al.l " (l2Q0l or Section 6.1.3 of IKomatsu et al.l 
t2009ah. 

iSekiguchi et al.l (|2010l ) combined the 5-year version of 
WMAP±BAO+7?o with the small-scale CMB data to 
find X;™!^ < 0-66 eV (95% CL). Therefore, the im- 
provement from this value to our 7-year limit, ^ < 
0.58 eV, indeed comes from a better determination of 
the amplitude of the third acoustic peak in the 7-year 
temperature data. 

The limit improves when information on the growth of 
structure is added. For exam ple, with WMAP+ Hp and 
the power spectrum of LRGs ()Reid et al]|2010al see Sec- 
tion 13X31) combined, we find J2m^ < 0.44 eV (95% CL) 
for w — —1. 

The WMAP+BAO+iJo limit on the neutrino mass 
weakens significantly to < 1-3 eV (95% CL) for 

w —1 because we do not use information of Type la 
supernovae here to constrain w. This is driven by w 
being too neg ative: there is an anti-correlation between 
w and X "^i' (jHannestadl 120051 ) . The best-fitting value 
of w in this case is w = -1.44 ± 0.27 (68% CL)E!I For 
WMAP+LRG+Ho, we find J^m-^ < 0-71 eV (95% CL) 
for w —1. When the Constitution supernova data are 
included ( WMAP+BAO+SN), we find Y^m^ < 0.710 
and 0.91 eV (95% CL) for w = —1 and w 7^ —1, respec- 
tively. 

Recent studies after the 5-year data release com- 
bined the WMAP 5-year data with information on the 
growth of stru c ture t o find various improved limits. 
iVikhlinin et al.l (|2009bl ) added the abundance of X-ray- 
selected clusters of galaxies, which were found in the 
ROSAT All Sky Survey and followed up by the Chan- 
d ra X-ray Obse rvatory (their cluster catalog is described 
in lVikhliniiTera l. 2009a ), to the WMAP 5 -year data, the 
BAO measurement fromlEisenstein et al.' (2005[), and the 
Type la supernova data from Davis ct al. (20 03), to find 
Vm. < 0.33 eV (95% CL) for w ^ -1. iMantz et al.l 
(l2010b[ ) added a different cluster catalog, also derived 
from the ROSAT All Sky Survey and followed up by 
the Chandra X-ray Observatory (their cluster catalog is 
described in IMantz et al.ll2010aD , and the measurement 
of th e gas mass fraction of relaxed clusters (I Allen et al.l 
I2008| ) to t he WMAP 5-vear d ata, the BAO measure- 
ment from iPercival et al.r (|2007l. and the "Union " Type 
la supernova samples frorn lKowalski et al.l ()2008[ ) (all of 

That the neutrino mass and w are anti-correlated impHes 
that the neutrino mass Hmit would improve if we impose a prior 
on w) as ui > — 1. 

29 The 7-year WMAP+BAO+SN limit for u) = -1 is slightly 
weaker than the 5-year WMAP+BAO+SN limit, 0.67 eV. The 5- 
year limit was derived using an approximate treatment of the effect 
of massive neutrinos on r s / Dy {z) . The 7-year limit we quote here , 
which uses the exact treatment of massive neutrinos f Section 13. 3p . 
is more reliable. 
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which constitute the 5 -year " WMAP+BAO-f SN" set in 
iKomatsu et"alll2009al) . to find J2 '^^ < 0-33 and 0.43 eV 
( 95% CL) for w = —\ and w ^ — I, respectively. 

iReid et all (|2010b[ ) added a prior on the ampli- 
tude of matter density flu ctuations, cr8(^7T t/0-25)° '*^ = 
0.832 ± 0.033 (68% CL; ,Rozo et al.l[20Tol) . which was 
derived from the abundance of optically-selected clus- 
ters of gal axies called the "maxBCG cluster catalog" 
(|Koester et al. 20071), to the 5-year WMAP+B AO+SN, 
and found ^ m^, < 0.35 and 0.52 eV (95% CL) for 
w — —1 and w ^ —1, respectively. iThomas et al.l 
()2010[) added the angular power spectra of photomet- 
rically selected samples of LRGs called "MegaZ" to the 
5-year WMAP-^BAO-H SN, and foun d to ,, < 0.325 eV 
(95% CL) for u; = -1. iWang et al.l ([Bol pointed out 
that the limit on ^ rrii, from galaxy clusters would im- 
prove significantly by not only using the abundance but 
also the power spectrum of clusters. 

In order to exploit the full information contained in 
the growth of structure, it is essential to understand the 
effects of massive neutrinos on the non-linear growth. 
All of the work to date (including WMAP+LRG+Ha 
presented above) included the effects of massive neu- 
trinos on the linear growth, while ignoring their non- 
linear effects. The widely-used phenomenological calcu- 
lation of the n on-linear matter p ower spectrum called 
the HALOFIT (|Smith et al.l l2003f ) has not been cali- 
brated for models with massive neutrinos. Consistent 
treatments of massive neutrinos in the non-linear struc- 
ture formati on us ing cosmological p erturbation theory 
JSaito et all [200l [2 009: Wong 2008; iLesgourgues et al.l 
[2009; Shoii & Komatsu 2009) and numerical simulations 
jI3randbyge ct al. 2008; Brandbyge & Hanncstad 2009) 
have just begun to be explored. More work along these 
lines would be necessary to exploit the information on 
the growth structure to constrain the mass of neutrinos. 

4.7. Relativistic Species 

How many relativistic species are there in the uni- 
verse after the matter-radiation equality epoch? We 
parametrize the relativistic degrees of freedom using 
the effective number of neutrino species, iVoff, given 
in equation ()23|) . This quantity can be written in 
terms of the matter density, Vlmh? , and the redshift of 
matter-radiation equa lity, Zoq, as (see equation (84) of 
IKomatsu et~alll2009a|) 



Zoq compared to the 5-year one. For models where A'cff 



AT^g = 3.04 + 7.44 



nmh"^ 3139 
0.1308 1 + z, 



- 1 



cq 



(53) 



(Here, ^mh'^ = 0.1308 and = 3138 are the 5-year 
maximum likelihood values from the simplest ACDM 
model.) This formula suggests that the variation in Ncs 
is given by 



5N^ 



2.45 



2.45- 



1 + Zo 



(54) 



The equality redshift is one of the direct observables from 
the temperature power spectrum. The WMAP data con- 
strain Zoq mainly from the ratio of the first peak to the 
third peak. As the 7-year temperature power spectrum 
has a better d etermination of the amplitude of the third 
peak ijLarson et al., ,2010i ). we expect a better limit on 



is different from 3.04, we find Zeq = 3145li^^ (68% CL) 



from the WMAP data onl3H, which is better than the 
5-year limit by more than 10% (see Table IH). 

However, the fractional error in ftmh^ is much larger, 
and thus we need to determine flmh^ using external 
data. The BAO data provide one constraint. We also 
find that Vl,nh^ and Hq are strongly correlated in the 
models with Ncs ^ 3.04 (see Figure El). Theref o re, an 
improved measurement of i/o from IRiess et al.l (|2009( ) 
would help reduce the error in Q.^h?', thereby reducing 
the error in N^s- The limit on Qmh^ from the 7-year 
WMAP+BAO+Hq combination is better than the 5-year 
" WMAP+B AO+SN+HSr' limit by 36%. 

We find that the WMAP+BAO+i?o limit on Ncs is 

TVeff = 4Mt°it (68% CL), 

while the WMAP+LRG+i7o limit is N^s = 
4.25l|^-^^ (68% CL), which are significantly bet- 
ter than the 5-year WMAP-t-BAO-hSN-j-ffST hmit, 
N^.ff = 4.4 ± 1. 5 (68% CL). 

IReid et all ()2010 b') added the maxB CG prior 
o-s(» m/0.25)° -^^ = 0.832 ± 0.033 (68% CL; Rozo et all 
mm . to the 5-year WMAP+BAO+SN+HST, and 
found A^cff = 3.5 ± 0.9 (68% CL). They also added the 
above prior to the 5-year version of WMAP-|-LRG-|-i/o, 
finding A^off = 3.77 ± 0.67 (68% CL). 

The constraint on A'off can also be interpreted as an 
upper bound on the energy density in primordial grav- 
itational waves with frequencies > 10~^^ Hz. Many 
cosmological mechanisms for the generation of stochas- 
tic gravitational waves exist, such as certain inflation- 
ary models, electroweak phase transitions, and cosmic 
strings. At low frequencies (10~^^ — 10~^® Hz), the 
background is constrained by the limit on tensor fluc- 
tuations described in Section H7T] Constraints at higher 
frequencies come from pulsa r timing measurements at 
^ 10~® Hz (jJenet et al.ll200(j ). recent data from the Laser 
Interferometer Gravitational Wave Observatory (LIGO) 
at 10 Hz (with limits of figw < 6.9 x lO''^ Abbott et all 
l2009f ). and at frequencies > lO"^'^ Hz from measure- 
ments of light-element abundances. A large gravitational 
wave energy density at nucleosynthesis would alter the 
predicted abundances, and observations impl y an upper 
bound of Qg^^h^ < 7.8 x lO^^ (Cyburt et al. 2001 ). 

The CMi5 provides a limit that reaches down to 
10~^^ Hz, corresponding to the comoving horizon at re- 
combination. The gravitational wave background within 
the horizon behaves as free-streaming masslcss particles, 
so affects the CMB and matter power spectra in the 
same way as massless neutrinos (S mith et al.ll200Q) . The 
density contributed by A^gw massless neutrino species is 
flgvjh'^ = 5.6 X 10~^Ng„. Constraints have been found 
using the WMAP 3-year data combined with additional 
cosmological probes by Smith et al. (2006), for both adi- 
abatic and homogeneous initial conditions for the ten- 
sor perturbations. With the current WMAP+BAO+i/o 
data combination, we define Ng^ — N^s — 3.04, and find 



•^^ For models with N^g 
(68% CL). 



3.04, we find Zcq 



3196^ 
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TABLE 8 

Improvements in N^a: 7-year versus 5-year 



Parameter 


Year 


WMAP only 


WMAP-FBAO+SN4 


-HST WMAP+BAO+Ho 


WAMP+LRG-l-ifo 




5-ycar 
7-ycar 




3240if, 


32091^^ 


3240 ± 90 




5-ycar 
7-ycar 


„ +U.044 
U-J-'O-0.041 


0.160 ± 0.025 


0.157 ± 0.016 


„ ,,.7+0.013 

'J- -"-0' -0.014 




5-ycar 
7-ycar 


> 2.3 (95% CL) 

> 2.7 (95% CL) 


4.4 ± 1.5 


4 34+0.86 


^■^^-0.80 



0.25 



0.20 



0.15 



0.10 





Ho [km/s/Mpc] 



N 



eff 



Fig. 9. — Constraint on the effective number of neutrino species, N^ff. (Left) Joint two-dimensional marginalized distribution (68% and 
95% CL), showing how a better determination of Hq improves a limit on VLmh^ ■ (Middle) A correlation between N^ff and Cl^h'^. The 
dashed line shows the line of correlation given by equation I I53I I. A better determination of Hq improves a limit on Q^h'^ which, in turn, 
improves a limit on A^cfl- (Right) One-dimensional marginalized distribution of Af^ff from WMAP-only and WMAP+HAO+Hq. The 68% 
interval from WMAP+BAO+Ho, N^s = 4.34l{5:i8' 

is consistent with the standard value, 3.04, which is shown by the vertical line. 



limits of 



iVgw < 2.85, 



< 1.60 X lO"'^ (95%CL) 



for adiabatic initial conditions, imposing an Ncs > 3.04 
prior. Adiabatic conditions might be expected if the 
gravitational waves were ge nerated by the appearance 
of cusps in cosmic string s (jPamour fc VilenkinI IWiol 
[2001 ISiemens et aT1[200ffl . For the WMAP+LRG-hi?o 
data, we find A^gw < 2.64, or flg^h'^ < 1.48 x 10~^ at 
95% CL. Given a particular string model, these bounds 
can be used to constrain the cosmic string ten sion (e.g., 
ISiemens et all 120071 : ICooeland fc Kibbld 120091) . 

4.8. Primordial Helium Abundance 

A change in the primordial helium abundan ce affects 
the s hape of the temperature power spectrum ()Hu et al.l 
I1995D . The most dominant effect is a suppression of the 
power spectrum at / > 500 due to an enhanced Silk 
damping effect. 

For a given mass density of baryons (protons and he- 
lium nuclei), the number density of electrons, Ug, can be 
related to the primordial helium abundance. When both 
hydrogen and helium were ionized, rig = {l~Yp/2)pi,/mp. 
However, most of the he lium recombines by z ~ 1800 (see 
iSwitzer fc Hiratal 120081 and references therein), much 
earlier than the photon decoupling epoch, z = 1090. As 
a result, the number density of free electrons at around 
the decoupling epoch is given by — (1 — Yp)pf,/mp oc 
(1 - Yp)nbh^ (IHu et all 119951) ■ The larger Yp is, the 
smaller Ue becomes. If the number of electrons is re- 
duced, photons can free-stream longer (the mean free 
path of photons, l/{aTne), gets larger), wiping out more 



temperature anisotropy. Therefore, a larger Yp results in 
a greater suppres sion of power on small angula r scales. 

[ Ichikawa et al.l ([2008) (also see llchikawa fc T akahashl 
120061 ) show that a 100% change in Yp changes the heights 
of the second, third, and forth peaks by 1%, 3%, and 
3%, respectively. Therefore, one expects that a com- 
bination of the WMAP dat a and small-scale CMB ex- 
perime nts such as ACBA R (jReichardt et al.l [20091 ) and 
QUaD (jBrown et al.ll2009l ) would be a powerful probe of 
the primordial helium abundance. 

In Figure [TUl we compare the WMAP, ACBAR, and 
QUaD data with the temperature power spectrum with 
the nominal value of the primordial helium abundance, 
Yp — 0.24 (pink line), and that with a tiny amount of 
helium, Yp = 0.01 (blue hne). There is too much power 
in the case of 1^ — 0.01, making it possible to detect the 
primordial helium effect using the CMB data alone. 

However, one must be careful about a potential degen- 
eracy between the effect of helium and those of the other 
cosmological parameters. First, as the number density of 
electrons is given by rig = (1 — Yp)nb cx (1 — Yp)^lbh'^, Yp 
and ilft/i^ may be correlated. Second, a scale-dependent 
suppression of power such as this may be cor related with 
the effect of tilt, Us (|Trotta k Hansenll2004l ). 

In the left panel of Figure [TTJ we show that ili,h^ and 
Yp are essentially uncorrelated; the baryon density is de- 
termined by the first-to-second peak ratio relative to the 
first-to-third peak ratio, which is now well measured by 
the WMAP data. Therefore, the current WMAP data 
allow rifc/i^ to be measured regardless of Yp. 

In the middle panel of Figure [HI we show that there 
is a slight positive correlation between Us and Ypi an 
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Fig. 10. — Primordial helium abundance and the temperature power spectrum. The data points are the same as those in Figure[7l The 
lower (pink) solid line (which is the same as the solid line in Figure (Tj shows the power spectrum with the nominal helium abundance, 
Yp = 0.24, while the upper (blue) solid line shows that with a tiny helium abundance, Yp = 0.01. The larger the helium abundance is, the 
smaller the number density of electrons during recombination becomes, which enhances the Silk damping of the power spectrum on small 
angular scales, I > 500. 
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Fig. 11. — Constraint on the primordial helium abundance, Yp. (Left) Joint two-dimensional marginalized distribution (68% and 95% 
CL), showing that Yp and O^h^ are essentially uncorrelated. (Middle) A slight correlation exists between Yp and Ug-. an enhanced Silk 
damping produced by a larger Yp can be partially canceled by a larger rig. (Right) One-dimensional marginalized distribution of Yp from 
WMAP-only and WMAP-I- ACBAR-|-QUaD. The 68% interval from WMAP-|- ACBAR-|-QUaD, Yp = 0.326 ± 0.075 is consistent with the 
nominal value, 0.24, which is shown by the vertical line. 



enhanced Silk damping produc ed by a larger 1^ can b e 
partially canceled by a larger Us (|Trotta fc Hansenll200l . 

We find a 95% CL upper limit of Fp < 0.51 from 
the WMAP data alone. When we add the ACBAR and 
QUaD data, we find a significant detection of the effect 
of primordial helium by more than 3a (see the right panel 
of Figure [Tl]) , 



99% CL lower limit is Yp > 0.11. This value is 
broadly consistent with the helium abundances estimated 
from observations of low-metallicity extragalactic ion- 
ized ( HII) regions, Y „ ~ 0.24 - 0.25 (iGr uenwald efal 



Yp = 0.326 ± 0.075 (68% CL). 



The 95% CL limit is 0.16 < Yp < 0.46. 



The 



200l llzotov fc Thu an 20 0l lOlive fc Sk iUman 200^, 
Pukueita & Kawasaki 2006r iPeimbert et al.li2007i ). See 
Steigman (2007) for a review. 

We can improve this limit by imposing an upper limit 
on Yp from these astrophysical measurements. As the 
helium is created by nuclear fusion in stars, the he- 
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lium abundances m easured from stars (e.g., Sun; see 
lAsplund et al.|[2009l for a recent review) and HII regions 
are, in general, larger than the primordial abundance. 
On the other hand, as we have just shown, the CMB 
data provide a lower limit on Yp. Even with a very con- 
servative hard prior, Yp < 0.3, we find 0.23 < Yp < 
0.3 (68% CLJ3- Therefore, a combination of the CMB 
and the solar constraints on Yp offers a new way for test- 
ing the predictions of theory of the big bang nucleosyn- 
thesis (BEN). For example, the BEN predicts that the 
helium abundance is related to the baryon-to-photon ra- 
tio, 77, and the number of additional neutrino species (or 
any other additional relativistic degrees of freedom) dur- 
in g the BEN epoc h, ANi, = — as (see equation (11) 
of lSteigmanl[2008l) 

Yp = 0.2485 + O.0016[(77io - 6) + 100(5 - 1)], (55) 



where S = y^l + (7/43)AiV^ ~ 1 + 0.081A7V^ and 
r]io = I0^°r] = 273.9(nh h^) = 6.19 ± 0.15 (68% CL; 
WMAP+BAO+Ho). (See ISimha fc SteigmanI HoOSl for 
more discussion on this method.) For AiV^ = 1, the he- 
lium abundance changes by AYp — 0.013, which is much 
smaller than our error bar, bu t is comparable to th e ex- 
pected error bar from Planck (jlchikawa et al.|[2008fl . 

There have been several attempts to measure Yr, 
from the CME data fTrotta & Hansen"2004'; 'Hucv ct al.' 
2004; Ichikawa^ k Takahashj .2006.: Jchikawa ct al. 2008; 
Dunklev et al.l \200W The previous best-limit is 1^ — 

0.25+° J°j^^^^J_a^_68% CL (95% CL), which was ob- 
tained bv llchikawa et al.l ()2008D from the WMAP 5-yea r 
data combined with ACEAR (iReichardt et al.l l2009l) 
EOOMER anC ("Jones e t al.ll2006l: iPiacentini et al.ll2006l: 
lMontroy~et al. 2006), and Cosmic Background Imager 
(CBL lSievers et al.. ,2007). Note that the likelihood func- 
tion of Yp is non-Gaussian, with a tail extending to 
Yp = 0; thus, the level of significance of detection was 
less than 3(T. 

5. CONSTRAINTS ON PROPERTIES OF DARK ENERGY 

In this section, we provide limits on the properties of 
dark energy, characterized by the equation of state pa- 
rameter, w. We first focus on constant (time indepen- 
dent) equation of state in a flat universe (Section 15.11) 
and a curved universe (Section l5.2p . We then constrain a 
time-dependent w given by w{a) — wp + Wq^l — a) , where 
a = 1/(1 -I- z) is the scale factor, in Section [5731 Next, 
we provide the 7-year "WMAP normalization prior" in 
Section [5.41 which is useful for constraining w (as well as 
the mass of neutrinos) from the growth of c osmic den- 
sity fluctuations. (See, e.g., [Vikhlinin et al.i r2009b. for 
an application of the 5-year normalization prior to the 
X-ray cluster abundance data.) In Section [531 we pro- 
vide the 7-year ''WMAP distance prior," which is useful 
for constraining a variety of time-dependent w models 
for which the Markov Chain Monte Carlo exploration of 
the parameter space may not be available. (See, e.g.. 

The upper limit is set by the hard prior. The 68% lower limit, 
Yp^min = 0.23, is found such that the integral of the posterior 
likelihood of Yp in yp_min < < 0.3 is 68% of the integral in 
<Yp < 0.3. Similarly, the 95% CL lower limit is Yp > 0.14 and 
the 99% CL lower limit is Yp > 0.065. 
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Fig. 12. — Joint two-dimensional marginalized constraint on the 
time-independent (constant) dark energy equation of state, w, and 
the curvature parameter, H^. The contours show the 68% and 
95% CL from WMAP+BAO+Hq (red), WMAP+BAO+Ho+DAt 
(black), and WMAP+BAO+SN (purple). 

ILi et al.l[2008t IWangil2008l 120091: Ivikhlinin et al.ll2009bL 

for applications of the 5-year distance prior.) 

We give a summary of our limits on dark energy pa- 
rameters in Table Uj 

5.1. Constant Equation of State: Flat Universe 

In a flat universe, fi^ = 0, an accurate determina- 
tion of Ho helps improve a limit on a consta nt equa- 
tion of state, w (" Spergel et all [2003HHuI [2005D . Using 
WMAP+BAO+Ho, we find 

w = -1.10 ±0.14 (68% CL), 

which improves to w = -1.08 ± 0.13 (68% CL) if 
we add the time-delay distan ce out to the lens system 
B1608-I-656 (jSuvu et al.[[20Tol see Section [MS]). These 
limits are independent of high-z Type la supernova data. 

The high-z supernova data provide the most strin- 
gent limit on w. Using WMAP+BAO-fSN, we find 
w = -0.980±0.053 (68% CL). The error does not include 
systematic errors in su pernovae, which are comparable 
to the statistical error (jKessler et al.|[2009l: [Hicken et al.l 
[2009bD : thus, the error in w from WMAP+BAO+S^ 
is about a half of that from WMAP+BAO+Hq or 

WMAP+BAO+Ho+DAt- 

The cluster abundance data are sensitive to w via the 
comoving volume element, angular dia meter distance, 
and g rowth of matter density fluctuations (jHaiman et al.l 
[2001f ). By combining th e cluster abundance da ta and 
the 5-year WMAP data, IVikhlinin et"all (|2009b[ ) found 
w = -1.08 ± 0.15 (stat) ± 0.02 5 (svst) (68% CL ) for a 
flat universe. By adding BAG of EiscnsteinelalJ (|2005[ ) 
and the supernova data of iDavis et al. {200"7t ). thev found 
w = -0.991 ± 0.045 (stat) ± 0.039 (syst) (68% CL). 
Thes e results using the cluster abundance data (also see 
Mant z~e"t al.[ [2010cf ) agree well with our corresponding 
WMAP-|-BAG-t-i7o and WMAP+BAG+SN hmits. 

5.2. Constant Equation of State: Curved Universe 



WMAP 7-year Cosmological Interpretation 



25 



. ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 _ 






wmap+Ho+Sn 




: ■ WMAP+BAOh-Ho+SN 




; □ WMAP+BAOh-H(,+Da,+SN 

" i 






-1.2 



-1.0 



-0.4 



Fig. 13. — Joint two-dimensional marginalized constraint on the 
linear evolution model of dark energy equation of state, ■w{a) = 
Wo + tUa(l — «)• The contours show the 68% and 95% CL 
from WMAP+Ho+ST<! (red), WMAP-I-BAO-I-Ho-I-SN (blue), and 
WMAP+BAO+Ho+DAt+ST<S (black), for a fiat universe. 

When flk ^ 0, limits on w significantly weaken, with 
a tail extending to large negative values of w, unless su- 
pernova data are added. 

In Figure [HI we show that WMAP+BAO+Hq 
allows for w < —2, which can be excluded 
by adding information on the time-delay distance. 
In both cases, the spatial curvature is well con- 
strained: we find fli, = -0.0125+!!-^^^v from 



hO.0060 
-0.0063 



0.0067 

'0 CL) from 



WMAP+BAO+Ho, and -0.0111 

WMAP+BAO+Ho+DAt, whose errors are comparable 
to that of the WMAP+BAO+Hq hmit on flk with w = 
-l,nk = -0.0023t;^:^^^g (68% CL; see Section ES]) . 

However, w is poorly constrained: we find w — 
-1.44 ± 0.27 from WMAP+BAO+Hq, and -1.40 ± 
0.25 (68% CL) from WMAP+BAO+Ho+DAt- 

Among the data combinations that do not use the in- 
formation on the growth of structure, the most powerful 
combination for constraining D,k and w simultaneously 
is a combination of the WMAP data, BAO (or D^t), 
and supernovae, as WMAP+BAO (or Dai) primarily 
constrains ftk, and WMAP+SN primarily constrains w. 
With WMAP+BAO+SN, we find w = -0.9991:o°qII and 

= -0.0057t[!:|][!^^ (68% CL). Note that the error 
in the curvature is essentially the same as that from 
WMAP+BAO+Hq, while the error in it; is ~ 4 times 

s maller. 

iVikhlinin et al.l (|2009b( ) combined their cluster abun- 
dance data with the 5-year WMAP-t-BAO-f SN to find 
w = —1-03 lb 0.0 6 (68% CL) for a curved universe. 
iReid et ahl (|2010aD combined their LRG power spectrum 
with the 5-year WMAP data and the Union supernova 
data to find w = -0.99 ± 0.11 and = -0.0109 ± 
0.0088 (68% CL). These results are in good agreement 
with our 7-year WMAP+BAO+SN limit. 

5.3. Time- dependent Equation of State 

As for a time-dependent equation of state, we shall find 
constraints on the present-day value of the equation of 



state and its derivative using a linear f orm, w(a) = wq + 
Wa{l-a) (IChevallier fc Polarskil[200lt ILindei<[200l . We 
assume a flat universe, flk = 0. (For recent limits on wla) 
with fifc 7^ 0, see Wang 200^ and references therein.) 
While we have constrained this model using the WMAP 
di stance prior i n the 5-year analysis (see Section 5.4.2 
of iKomatsu et a l. 2009a), in the 7-year analysis we shall 
present the full Markov Chain Monte Carlo exploration 
of this model. 

For a time-dependent equation of state, one must be 
careful about the treatment of perturbations in dark en- 
ergy when w crosses —1. We use the "parametrized post- 
Friedmann" (PPF) approach, implemented in the CAMB 
code following Fan g et al.l (|2008[ )FI 

In Figure [131 "we show the 7-year con- 
straints on Wq and Wa from WMAP-|- _ffo+SN 
(red), WMAP+BAO+i7o+SN (blue), and 
WMAP-t-BAO-t-iJo+I^At+SN (black). The angular 
diameter distances measured from BAO and Da* help 
exclude models with large negative values of Wa- We find 
that the current data are consistent with a cosmological 
constant, even when w is allowed to depend on time. 
However, a large range of values of {wo,Wa) are still 
allowed by the data: we find 



Wq 



-0.93 ±0.13 and Wa 



OAlt°Pi (68% CL), 



from WA/TAP+BAO+i/o+SN. When the time-delay dis- 
tance information is added, the limits improve to wq = 
- 0.93 ± 0.12 and Wg = - 0.38+i^;^^ (68% CL). 

IVikhlinin et al.l (|2009b[ ) combined their cluster abun- 
dance data with the 5-year WMAP+BAO+SN to find a 
limit on a linear combination of the parameters, Wa + 
3.64(1 + Wo) = 0.05 ± 0.17 (68% CL). Our data combi- 
nation is sensitive to a different linear combination: we 
find Wa + 5.14(1 + Wo) = -0.05 + 0.32 (68% CL) for the 
7-year WMAP+BAO+i/o+SN combination. 

The current data are consistent with a flat universe 
dominated by a cosmological constant. 

5.4. WMAP Normalization Prior 

The growth of cosmological density fluctuations is 
a powerful probe of dark energy, modifled gravity, 
and massive neutrinos. The WMAP data provide a 
useful normalization of the cosmological perturbation 
at the decoupling epoch, z = 1090. By compar- 
ing this normalization with the amplitude of matter 
density fluctuations in a low redshift universe, one 
may distingu ish between dark energy and modi 
fied gr avitv (llshak et al.l 120061: iKovama fc Maarten; 
'200B- lAmarzguioui et al.l '20061: 'Pore et all 1200 
Lindcr & Cahn 2007: Upadhvc 2007; Z hang et all 
20071: ,Y amamoto ct al. 2007; Chiba & Takahashi"2007|; 
Bean et al. 2007; Hu & Sawicki 2007; Song ct al. 200l 
Starobinskv 2007; Daniel ct al. 2008; Jain & Zhan_g 
1)08; Bertschingcr fc ZuMn 2008: . Amin et al,, 2008; .Hu 
200^ ) and determine the mass o f neutrinos (iHu et al 
19981 iLesgourgues fc Pastoiil2006D 



11- 

1 



In Section 5.5 of IKomatsu et all ()2009aD . we provided 
a "WMAP normalization prior," which is a constraint 



IZhao et al.l (|2P05) used a multi-scalar-field model to treat ui 
crossing —1. The constraints on wq and Wa have bee n obtained 
using this model and the previous years of WMAP data IjXia et al.l 
200 6. .2008a; Zhao ct a l. 2007,) . 
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TABLE 9 

WMAP DISTANCE PRIORS OBTAINED FROM 
THE WMAP 7- YEAR FIT TO MODELS WITH 
SPATIAL CURVATURE AND DARK ENERGY. 
The CORRELATION COEFFICIENTS ARE: 

^ 0.7357. 



I A, 





7-ycar ML" 


7-ycar Mcan^ 


Error, a 


Ia 


302.09 


302.69 


0.76 


R 


1.725 


1.726 


0.018 


2* 


1091.3 


1091.36 


0.91 



" Maximum likelihood values (recom- 
mended) . 

^ Mean of the likelihood. 



TABLE 10 
Inverse cov.ariance matrix 

FOR the WMAP DISTANCE 
PRIORS 





Ia R 




Ia 


2.305 29.698 


-1.333 


R 


6825.270 


-113.180 


.2* 




3.414 



on the power s pectrum of curvature perturbation, A^. 
iVikhlinin et al.i (.2009b,) combined this with the number 
density of clusters of galaxies to constrain the dark en- 
ergy equation of state, w, and the amplitude of matter 
density fluctuations, erg- 
The matter density fluctuation in Fourier space, (5m, k, 

-7^kT(fc)-D(fc,^), 



2/c^ 



5//ga, 



is related to T^-k as 5m,\i{z) 

where D(k, z) and T{k) are the linear growth rate and the 
matter transfer function normalized such that T{k) — > 1 
as A; — >■ 0, and (1 -I- z)D{k, z) — 1 as fc — >■ dur- 
ing the matter era, respectively. Ignoring the mass of 
neutrinos and modifications to gravity, one can obtain 
the growth rate by s olving a sing le diffe rential equ ation 
(IWang fc Steinhard t 1998; Linder fc Jenkinsll2003D FI 
The 7- year normalization prior is 



A'^{kwMAp) 



= (2.208 ±0.078) X 10"^ (68% CL), 



where kwMAP — 0.027 Mpc . For comparison, the 5- 
year normalization prior was A|(0.02 Mpc"^) = (2.21± 
0.09) X 10"^. This normalization prior is valid for mod- 
els with fifc 7^ 0, w 7^ —1, or > 0. However, 
these normalizations cannot be used for the models that 
have the tensor modes, r > 0, or the running index, 
dus/dlnk 7^ 0. 

5.5. WMAP Distance Prior 

The temperature power spectrum of CMB is sensi- 
tive to the physics at the decoupling epoch, z = 1090, 
as well as the physics between now and the decoupling 
epoch. The former primarily affects the amplitude of 
acoustic peaks, i.e., the ratios of the peak heights and the 
Silk damping. The latter changes the locations of peaks 
via the angular diameter distance out to the decoupling 
epoch. One can quantify this by (i) the "acoustic scale" , 



3^ See, e.g., equation (79) of lKomatsu et aLI Il2009al) . Note that 
there is a typo in that equation: w^f[[a) needs to be replaced by 
w{a). 



Ia = (1 + z,) 



rs{z:t) 



(56) 



where is the redsh ift of de coupling, for w hich we use 
the fitting formula of lHu &: Sugi vamal (119961), as w ell as 
by (ii) the "shift parameter." R "(jBond et al.lll997[ ). 



R 



(1 + Z^)DAiZ:t). 



(57) 



These two parameters capture most of the con- 
straining po wer of the WMAP data fo r dark e nerg; 
properties (iWang fc Mukheried [20071: iWrig ht' '200. 



ergy 
flOa 
iorril 



Elgar0v fc Multamakil l2007t iCorasaniti fc Melchiorr: 
20081) ■ with one important difference. The distance 
prior does not capture the information on the growth 
of structure probed by the late-time ISW effect. As 
a result, the dark energy constraints derived from the 
distance prior are similar to, b ut weaker than , those 
derived from the full analysis ([Komatsu et al.l l2009al : 
ILi et al.l[2008l) . 

We must understand the limitation of this method. 
Namely, the distance prior is applicable only when the 
model in question is based on: 

1. The standard Friedmann-Lemaitre- Robertson- 
Walker universe with matter, radiation, dark 
energy, as well as spatial curvature, 

2. Neutrinos with the effective number of neutri- 
nos equal to 3.04, and the minimal mass (niu ~ 
0.05 eV), and 

3. Nearly power-law primordial power spectrum of 
curvature perturbations, \dns/dhik\ ^ 0.01, negli- 
gible primordial gravitational waves relative to the 
curvature perturbations, r <^ 0.1, and negligible 
entropy fluctuations relative to the curvature per- 
turbations, a ^ 0.1. 

In Table 1^ and [TUl we provide the 7-year distance prior. 
The errors in I a, R, and have improved from the 5-year 
values by 12%, 5%, and 2%, respectively. To compute the 
likelihood, use 



21nL 



{xi - d.j){C ^)ij{xj - dj), 



(58) 



where Xi = {I a, R, z^) is the values predicted by a model 



in question, d, = {ij^^^ , rWMAP^ ^wmap^ jg ^^^^ 



given in Table [HI and C^^ is the inverse covariance 
matrix given hi Table [TOl Also see Section 5.4.1 of 
IKomatsu et"all ()2009aD for more information. 



6. PRIMORDIAL NON-GAUSSIANITY 

6.1. Motivation and Background 

During the period of cosmic inflation (Starobinskii] 
, 1979: Starobinsky 1982; Gut h 1981; Sato 1981; Find 
Il982l: lAlbrecht fc Steinhardtl Il98l) . quantum fluctua- 



tions were generated and became the seed s for the cosmic 
struc t ures that we obse r ve today ( Mukha nov fc Chibisov 



1981 1 : iHawking' l98^; 'Sta robinskvl |198ljGuth & PJ 
1982t iBardeen et al.. .198a ). (Also see iLindc .199C 
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Mukhanov et~all [1991 JLiddle fc LvthI [2000l [200l 
Bassett at al.ll200d: iLindiboOSl for reviews.) 



Inflation predicts that the statistical distribution of 
primordial fluctuations is nearly a Gaussian distribution 
with random phases. Measuring deviations from a Gaus- 
sian distribution, i.e., non- Gaussian correlations in pri- 
mordial fluctuations, is a powerful test of inflation, as 
how precisely the distribution is (non-)Ga ussian depends 
on the detailed physics of inflation (see iBartolo et al.l 
[2OOI iKomats u et"aLll2009U for reviews). 

In this paper, we constrain the amplitude of non- 
Gaussian correlations using the angular bispectrum of 
CMB temperature anisotropy, the har monic transform 
of the 3-point correlation function (see iKomatsul I2001L 
for a review). The observed angular bispectrum is re- 
lated to the 3-dimensional bispectrum of primordial cur- 
vature perturbations, (CkiCk2Ck3) = (27r)'^(5^(ki -I- k2 -f 
k3)_B^(fci, ^2, fes). In the linear order, the primordial cur- 
vature pertur bation is related to Bardeen's curvature 
perturbation ()Bardeenl ll98Clf ) in the matter-dominated 
era, by C = ("e.g.. IKodama fc Sasakil IT981 . The 
CMB temperature anisotropy in the Sachs-Wolfe limit 
( |Sachs fc W olfc 1967) is given by AT/T = -i$ = -iC- 
We write the bispectrum of $ as 

($ki$k.$k3) = (2^)3j^(ki+k2 + k3)F(fci,fc2,fc3). (59) 

We shall explore 3 different shapes of the primordial 
bispectrum: "local," "equilateral," and "orthogonal." 
They are defined as follows: 

1. Local fo rm. The local f o rm bispectrum is 
given bv (IGangui et all [1991 IVerde et al.l [2OOOI: 
IKomatsu fc Spergelll2001[) 



-flocal(A;i,fc2,fc3) 

= 2/]^2"'[^*(fci)^*(fc2) 



NL 



(2 perm. 



(60) 



where P$ = A/fc'*^"= is the power spectrum of 
$ with a normalization factor A. This form is 
called the local form, as this bispectrum can arise 
from the curvature perturbation in the form of 
$ = $L -I- where both sides are evaluated 

at the same location in space ($l is a linear Gaus- 
sian fluctuation) Equation (|60| peaks at the so- 
called "sq ueezed" triangle for which k-^ <^ k2 ~ fci 
(jBabich et al . 2004). In this limit, we obtain 



Fiocai(fci, fci, fca ^ 0) = 4/]^r'-P*(fci)^*(fc: 



(61) 



^ However, # = #i -|- /]^|f "I"!, is not the only way to pro- 
duce this type of bispectrum. One can also produce this form from 
multi-scalar field inflation models where scalar field fluctuations 
are nearly scale invariant (Lvth & Rodriguez 2005); multi-scalar 
models cal led "curvaton" scenarios (Lindc & Mukhanov 1993; 
ILvth et al.| [2003^: multi-field models in which one field modulates 
the d ecay rate of inflaton field tPvali et al. .2004a. . b,; Zaldarriagal 
120041) ; multi-field models in which a violent production of parti- 
cles and non-linear re heating, called "preheat ing," occur due to 
par ametric resonances fEnqvist et al. 2005; Jokmcn fc Mazumdarl 
12006 : Chambers & Raiantic 2008; Bond et al. 2009); mode ls in 
which the universe contracts first and then bounces (see ILehnersI 
120081 . for a review). 



How large is from inflation? The ear- 

lier calculations showed that /j^^'^' from single- 
field slow-roll inflation would be of orde r the 
slow-roll parameter, e ~ 10~^ (I^^Q Pgj L'^ Bondl 
[19901 : iFalk et a l.' 1993; Ga ngui et'aiTll994D. More 
recentlv. iMaldacena (20031) and lAcquaviva et all 
(200|) found that the coefladent of P^{ki)P^{k2,) 
from the simplest single-field slow-roll inflation 
with the canonical kinetic term in the squeezed 
limit is given by 



^^local(fcl,fci,fc3 ^ 0) = -(1 - n,)P*(fci)P<E,(fc3). (62) 



Comparing this result with the form predicted by 
the f§f^ model, one obtains f^f" = (5/12) (1 - 
n,), which gives fST^ = 0.015 for = 0.963. 



2. Equilateral for m. The e quilateral form bispec- 
trum is given by ( Creminelli et al.ii2006i ) 

-Foquii(fci, ^2,^3) = ^A^TnT"" 

1 1 1 



"-1 "-2 



J 4— ris J 4— ris 
K2 Kg 



7 4 — ris J 4 — Us 
Kg Ki 



(fclfc2fc3)2(4-" = )/3 

-|-(5 perm.)]} . 



1 



(4-n,)/3,2(4-n,)/3,4- 



(63) 



This function approximates the bispectrum forms 
that arise from a class of inflation models in 
which scalar flelds have non-canonical kinetic 
terms. One example is the so-called Dirac - Born- 
Infeld (DBI) inflation (jSilverstein fc Tond 120041: 



lAlishahiha et al.|[200l) , which gives / 



equil 
NL 



oc 



in the limit of <C 1, where Cs is the effec- 
tive sound speed at which scalar field fiuctua- 
tions propagate. There are various other mod- 
els th a t can produce f ^?"^' j Arkani -HarnQd et al.l 



2OOI ISeerv fc LidseV ^2005; Ch en eTalJ 



Cheung et aiTf200& : ,Li et al.. 200 8). The local and 



equilateral forms are nearly orthogonal to each 
other, which means that both can be measured 
nearly independently. 

3. Orthogonal form. The orthogonal form, which 
is constructed such that it is nearly orthogonal to 
both the local and e quilateral forms, is given by 
(jSenatore et al.ll20ldl ) 



-Forthog(fcl, ^2,^3) 

3 

jL4-ns 1.4-ns 
"-1 "-2 



tlioe 



{kik2k3)^(^-"^)/^ 

h(5 perm.)]} . 



(4-n,)/3,2(4-n,)/3,4- 



(64) 



This form approximates the forms that arise from a 
linear combination of higher-derivative scalar-field 
interaction terms, each of which yields forms simi- 
lar to the equilateral shape. iSenatore et al.. (,20ldl ) 
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TABLE 11 

Estimates" and the corresponding 68% intervals of the 

PRIMORDIAL NON-GaUSSIANITY PARAMETERS (/Jj^"', 
/iVi'"'*) ^'^^ ™^ POINT SOURCE BISPECTRUM AMPLITUDE, b^rc (iN 
UNITS OF 10~^ llK^ sr^), FROM THE WMAP 7-YEAR TEMPERATURE 
MAPS 



Band 


Foreground^ 


i-local /-cquil 
JNL JNL 


i-orthog 
^ NL 




V+W 


Raw 


59 ± 21 33 ± 140 


-199 ± 104 


N/A 


v+w 


Clean 


42 ± 21 29 ± 140 


-198 ± 104 


N/A 


V+W 


Marg." 


32 ± 21 26 ± 140 


-202 ± 104 


-0.08 ± 0.12 


V 


Marg. 


43 ± 24 64 ± 150 


-98 ± 115 


0.32 ± 0.23 


w 


Marg. 


39 ± 24 36 ± 154 


-257 ± 117 


-0.13 ± 0.19 



^ The values quoted for "V+W" and "Marg." arc our best 
estimates from the WMAP 7-year data. In all cases, the full- 
resolution temperature maps at HEALPix A/^sidc — 1024 are 
used. 

^ In all cases, the KQ75y7 mask is used. 

^ "Marg." means that the foreground templates (synchrotron, 
free-free, and dust) have been marginalized over. When the fore- 
ground templates are marginalized over, the raw and clean maps 
yield the same /nl values. 

found that, using t he "effective field th eory of in- 
flation" approach (jCheung et al.|[2008l ). a certain 
linear combination of similarly equilateral shapes 
can yield a distinct shape which is orthogonal to 
both the local and equilateral forms. 

Note that these are not the most general forms one 
can write down, and there are other forms which 
would p robe different aspect s of the physics of in- 
flation (iMoss k XionE l2 007t iMoss fc GrahamI 120071: 
Chen et al.ll2'00 7: Holman fc Tolleyll2008HChen fc Wa^l 



20101: IChen fc Wang.,2010,) . 



Of these for ms, the local form bispect r um ha s special 
significance. iCreminelli fc Zaldarriagal (|2004l ) showed 
that not only models with the canonical kinetic term, 
but all single-infiation models predict the bispectrum 
in the squeezed limit given by Eq. regardless of 

the form of potential, kinetic term, slow-roh, or initial 
vacuum state falso seelSeerv fc Lidsevll2005l : iChen et all 
[20071: ICheung et al.ll2008n ~ This means that a convincing 
detection of fj^f^^ would rule out all single-field inflation 



models. 



6.2. Analysis Method and Results 



The first hmhon^£]21!i 

was obtained from the COBE 
4-year data (jBennett et al.l 119961 ) by iKomatsu et alJ 
(pool , using the angular bispectrum. The limit 
was improved by an order of magnitude when the 
WMAP first ye ar data were used to constrain f]^'j^^ 
(|Komatsu et all [2003, ). Since then the limits have 
improved steadily as WMAP collect more years of 
data and the b ispectrum method for estimatin g f^r'^^ 
has i mproved (IKomatsu et al.l [20051: ICr eminel li et alJ 
2006', '2007'; 'Soergel et al."200r; 'Yadav & Waiidelt||2008!; 
Komatsu et al. 2009a; Smith ct al. 2009) 

For references to other methods for estimating f]S'}^^ , which do 
not use the bispectrum directly, see Section 3.5 of IKomatsu et aT\ 
(|2009al ) . Recently, the "skewness power spectrum" has been pro- 
posed as a new way to measure and other non-Gaussian 
components such as the s e condary ani sotropics and point s ources 
jMunshi & Heavens 20TO|; ISmidt et al. 2009; Munshi ct all l2009i : 
[Calabrcsc ct al. 2010). In the lim it t hat noise is uniform, their 
estimator is equivalent to that of IK omatsu et al.l (|2005l ) , which 
also allows for simultaneous estimations of multiple sources of non- 



In this paper, we shall adopt the optimal estimator 
(developed bypB abich 2005; Crc minelli et al.|[2ml [20071: 
iSmith fc Zaldarriagai i2006i: i Yadav et al.l I2008D . which 
builds on and significantly ir nproves the orig i nal bi spec- 
trum estimator proposed by 'Koma tsu et aLl ()2005| ). es- 
pecially when the spatial distribution of instrumental 
noise is not uniform. Fo r deta. ils of the method, see Ap- 
pendix A of lSmith et all ([20091) for f°l^\ and Section 4.1 

of lSenatore eHdl ([20Tfll ) for fj^f and /"f To con- 
struct the optimal estimators, we need to specify the cos- 
mological parameters. We use the 5-year ACDM param- 
eters from WMAP-fBAO-hSN, for which = 0.96. 

We also constrain the bispectrum due to residual (un- 
resolved) point sources, bsrc- The optimal estimator for 
bs rc is constructed by replacing aim/Ci in equation (A24) 
of lKomatsu et al.l (|2009a) with {C~^a)im, and using their 
equations (A17) and (A5). The matrix is compute d 
by the multigrid-based algorithm ofl Smith et al.l (|2007D . 

We use the V- and W-band maps at the HEALPix res- 
olution iVsidc = 1024. As the optimal estimator weights 
the data optimally at all multipoles, we no longer need 
to choose the maximum multipole used in the analysis, 
i.e., we use all the data. We use both the raw maps (be- 
fore cleaning foreground) and foreground-reduced (clean) 
maps to quantify the foreground contamination of /nl 
parameters. For all cases, we find the best limits on 
/nl parameters by combining the V- and W-band maps, 
and marginalizing over the sy nchrotron, free -free, and 
dust foreground templates (G old et al.l l2010|). As for 



the m ask, we always use the KQ75y7 mask ( Gold et al.l 
[20T0I) . 

In Table 1111 we summarize our results: 

1. Local form results. The 7- year best estimate of 
is 



f local 
JNL 



xlocal 
JNL 



32 ±21 (68% CL). 



The 95% limit is -10 < /j^f' < 74. When 
the raw maps are used, we find f]^j^^ = 59 ± 
21 (68% GL). When the clean maps are used, but 
foreground templates are not marginalized over, 
we find fj^l""^ = 42 ± 21 (68% CL). These resuhs 
(in particular the clean-map versus the foreground 
marginalized) indicate that the foreground emis- 
sion makes a difference at the level of Afj^j^^ ~ 
100 We find that the V-^W result is lower than 
the V-band or W-band results. This is possible, 
as the V-l-W result contains contributions from the 
cross-correlations of V and W such as (WW) and 
(VWW). 

2. Equilateral form results. The 7-year best esti- 



mate of f^l'^ 

j-equil 
JNL 



26 ± 140 (68% CL). 



Gaussianity (see Appendix A of lKomatsu et aI1l2009j ). The skew- 
ness power spectrum method provides a means to visualize the 
shape of various bispectra as a function of multipoles. 

The effect of the foreground marginalization depends on 
an estimator. Using the needlet bispectrum, Cabella et al. 
ICabella et aP l(20p) found f^°'ff' = 35 ± 42 and 38 ± 47 (68% CL) 
with and without the foreground marginalization, respectively. 
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The 95% Umit is -214 < f^^^ < 266. For f^f, 
the foreground marginahzation does not shift the 



central values very much, Af\ 



cquil 

NL 



This 



makes sense, as the equilateral bispectrum does not 
couple small-scale modes to very large-scale modes 
^ ^ lOj which are sensitive to the foreground emis- 
sion. On the other hand, the local form bispectrum 
is dominated by the squeezed triangles, which do 
couple large and small scales modes. 

3. Orthogonal form results. The 7-year best esti- 

rthoff 



mate of /' 



NL 



IS 



^orthog ^ _202 ± 104 (68% CL). 



rthoe 



The 95% limit is -410 < /^vl 
ground marginahzation has little effect, A/^^ 
-4. 



< 6. The fore- 

orthog 



As for the point-source bispectru m, we do not d etect 
bsrc in V, W, or V+W. In KomatsujeTaD (|2009al ). we 
estimated that the residual sources could bias by 
a small positive amount, and applied corrections using 
Monte Carlo simulations. In this paper, we do not at- 
tempt to make such corrections, but we note that sources 
cou ld give A/j\°?^' 2^ 2 (note that the simulations used by 
iKomatsu et al.l (j2609al) likely overestimated the effect of 
sources by a facto r of two) . As the estim ator has changed 
from that used bv IKomatsu et al.] (|2009al) . extrapolating 
the previous results is not trivial. Source corrections to 

/^l'' fNL^°^ could be larger (jKomatsu et al. 2009a), 
but we have not estimated the magnitude of the effect 
for the 7-year data. 

We used the linear perturbation theory to calculate 
the angular bispectrum of primordial non-Gaussianity 

(Komatsu & SDcrgcl 2001). Second-order effects 

(IPvne fc C arroll 1996; MoUcrach & Matarresd 119971: 
iBartolo et al. 2006 , .2007^ ,Pitrou .2009ayl^are expected 
to giv e /i??'^' 1 (iNitta et al.l 120091: iSenatore et al 
2009allbl: iKhatri fc Wandeltl l2009al lbl: iBoubekeur et al 



2009r iPitrou et al ■I I2008n and are negligible given the 



noise level of the WMAP 7-year data. 

Among various sources of secondary non-Gaussianities 
which might contaminate measurements of primor- 
dial non-Gaussianity (in particular f]^'j^^), a coupling 
between the ISW effect and the weak gravitational 
lensing is the most dominant sour ce of confusion 
for fj^f"^ dGoldberg & SDcrEcl 199 91: I Verde fc Spergell 
2002; Smith fc Zaldarriaga 2006; Sc rra fc Cooravl 120081: 
Hanson et al,,2009 : .MangiUi fc Verddl2009[ ). While this 



contribution is expected to be detectable and bias the 
measurement of /j^^**' for Planck, it is expected to be 
negligible for WMAP: using the method of Hanso n et al.l 
(j2009|), we estimate that the expected signal-to- noise ra- 
tio of this term in the WMAP 7-year data is about 0.8. 
We also estimate that this term can give a po- 

tential positive bias of A/]^|f' ~ 2.7. iCalabrese et al.l 
(Hq IO) used t he ske wness power spectrum method of 
iMunshi et al.l (|2009n to search for this term in the 
WMAP 5-year data and found a null result. If we sub- 
tract Af]^'j^^ estimated above (for the residual source 
and the ISW-lensing coupling) from the measured value, 
A/Jvzf' becomes more consistent with zero. 



From these results, we conclude that the WMAP 7- 
year data are consistent with Gaussian primordial fluc- 
tuations to within 95% CL. When combined with the 
hmit on 1 from SDSS, -29 < f^f"^ < 70 (95% CL 
ISIosar et al.ll2008l ). we find -5 < /]^f ' < 59 (95% CL). 

7. SUNYAEV-ZEL'DOVICH EFFECT 

We review the basics of the SZ effect in Section 17.11 
In Section 17.21 we shall test our optimal estimator for 
extracting the SZ signal from the WMAP data using the 
brightest SZ source on the sky: the Coma cluster. We 
also present an improved measurement of the SZ effect 
toward the Coma cluster (3.6cr). 

The most significant result from Section 17731 is the dis- 
covery of the thermal/dynamical effect of clusters on 
the SZ effect. We shall present the measurements of 
the SZ effects toward nearby ( z < 0.09) galaxy clus - 
ters in Vikhlinin et al.'s sample ([Vikhlinin et al.|[2009aD . 
which were used to in fer the cosmological parameters 
(jVikhlinin eralll2009bl ) . We then compare the measured 
SZ flux to the expected flux from the X-ray data on the 
individual clusters, finding a good agreement. Signifi- 
cance of detection (from merely 11 clusters, excluding 
Coma) is 6.5(t. By dividing the sample into cooling-flow 
and non-cooling-flow clusters (or relaxed and non-relaxed 
clusters), wc find a significant difference in the SZ effect 
between these sub-samples. 

In Section 17741 we shall report a significant (^ 8a) sta- 
tistical detection of the SZ effect at hundreds of positions 
of the known clusters. We then compare the measured 
SZ flux to theoretical models as well as to an X-ray- 
calibrated empirical model, and discuss implications of 
our measurement, especially a recent measurement of the 
lower-than-theoretically-expected SZ power spectrum by 
the SPT collaboration. 

Note that the analyses presented in Section 17.31 and 
17.41 are similar but different in one important aspect: 
the former uses a handful (29) of clusters with well- 
measured Chandra X-ray data, while the latter uses hun- 
dreds of clusters without detailed X-ray data. Therefore, 
while the latter results have smaller statistical errors (and 
much larger systematic errors), the former results have 
much smaller systematic errors (and larger statistical er- 
rors). 

7.1. Motivation and Background 

When CMB photons encounter hot electrons in clus- 
ters of galaxies, the temperature of CMB changes 
due to the inverse Compton scattering by these elec- 
trons . This effect, known as the thermal SZ ef- 
fect (jZel'dovich fc SunvaevI Il969l : iSunvaev fc Zel'dovichl 
[Ml), is a source of significant additio nal (secondary) 
anisotropics in the microwave sky (see iRephaelil [l995l: 
lBirkinshawlll999l : iCarlstrom et al.ll2002l for reviews). 

The temperature change due to the SZ effect in units 
of thermodynamic temperature, ATgz, depends on fre- 
quency, and is given by (for a spherically symmetric 
distribution of gas): 



ATsz(0) 



cmb 



(7X 

m^c^ 



dl P, 



^/p + ¥D\y (65) 



where 9 is the angular distance from the center of a clus- 
ter of galaxies on the sky, Da the proper (not comoving) 
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angular diameter distance to the cluster center, I the ra- 
dial coordinates from the cluster center along the line of 
sight, Pe{r) the electron pressure profile, ctt the Thom- 
son cross section, rUe the electron mass, c the speed of 
light, and the spectral function given by 



1 r 



= xcoth (^1^ - 4, 



(66) 



where x = hv j {kgTcaih) — i^/(56.78 GHz) for Tcmb = 
2.725 K. In the Rayleigh- Jeans limit, — 0, one finds 
g^ ^ -2. At the WMAP frequencies, g^ = -1.97, 
-1.94, -1.91, -1.81, and -1.56 at 23, 33, 41, 61, and 
94 GHz, respectively. The integration boundary, lout^ 
will be given later. 

The thermal SZ effect (when relativistic corrections are 
ignored) vanishes at ~ 217 GHz. One then finds g„ > 
at higher frequencies; thus, the thermal SZ effect pro- 
duces a temperature decrement aXv < 217 GHz, vanishes 
at 217 GHz, and produces a temperature increment at 
V > 217 GHz. 

The angular power spectrum of temperature 
anisotropy caused by the SZ effect is sensitive to both the 
gas d is tribution in clust ers (Atrio-Barandcla & Miicket 
Il999t iKomatsu &: Kitavamal l1999) and the am- 

plitude of matter density fluctuations, i.e ., cts 

(IKomatsu fc Kitavamal Il999[ IKomatsu fc SeliakI 120021 : 
iBond et all 120051 ). While we have not detected the SZ 
power spectrum in the WMAP data, we have detected 
the SZ signal from the Co ma cluster (Ab ell 1656) in the 
1-yea r ()Bennett et al.ll2003ci) and 3-year (jHinshaw et al.l 
120071 ) data. 

We have also made a statistical detection of the 
SZ effect by cross-correlating the WMAP data 
with the locations of known clusters in the X-ray 
Brightest Abell- type Cluster QCBAC; lEbeling et al " 
1996) catalog (jBennett et al.l l2003cl: iHinshaw et"aT 
2007i ). In addition, there have been a number of 
statistical detections of the SZ effect reported by 
many groups using various methods (Fosalba et al. 
2003; Hcrnandcz-Montcagudo & Rubino-Martin 2004; 
Hernandcz-Montcagudo ct al. 2004; Mv ers et al.l 120041: 
Afshordi et al. 2005; .Lieu et al., ,2006: Bi elbv & Shanks 
20iT7t lAfshordi et al.l 120071 : 



Atrio-Barandcla et al.l 



2008t iKashhnskv et all 120081 : [DiegofcPartridgeil20& 
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limsKy 



7.2. Coma Cluster 

The Coma cluster (Abell 1656) is a nearby {z = 
0.0231) massive cluster located near the north Galac- 
tic pole {I, 6)=(56.75°, 88.05°). The angular diame- 
ter distance to Coma, calculated from z = 0.0231 and 
(nm^A) = (0.277,0.723), is Da = 67 h^'^ Mpc; thus, 
10 arcmin on the sky corresponds to the physical distance 
of 0.195 h-^ Mpc at the redshift of Coma. 

To extract the SZ signal from the WMAP tempera- 
ture map, we use the optimal method described in Ap- 
pendix [C] we write down the likelihood function that 
contains CMB, noise, and the SZ effect, and marginalize 
it over CMB. From the resulting likelihood function for 
the SZ effect, which is given by equation (|C7p . we find 
the optimal estimator for the SZ effect in a given angular 
bin a, Pa, as 
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Fig. 14. — Angular radial profile of the SZ effect toward the 
Coma cluster, in units of the Rayleigh-Jeans (RJ) temperature 
(^K). While the V- (green) and W-band (blue) measurements are 
contaminated by the CMB fluctuations around Coma, our optimal 
estimator can separate the SZ effect and CMB when the V- and W- 
band measurements are combined (red). The solid line shows the 
best-fitting spherical /3 model with the core radius of 9c = 10.5 ar- 
cmin and (9 = 0.75. The best-fitting central temperature decrement 
(fit to a /3 model) is Tsz,Rj(0) = -377 ± 105 ^K. Note that 10 ar- 
cmin corresponds to the physical distance of 0.195 Mpc at the 
location of Coma. The radius within which the mean overdensity 
is 500 times the critical density of the universe, rsoo, corresponds 
to about 50 arcmin. 

where the repeated symbols are summed. Here, di,p is the 
measured temperature at a pixel p in a frequency band 
f^, (ta)i/p is a map of an annulus corresponding to a given 
angular bin a, which has been convolved with the beam 
and scaled by the frequency dependence of the SZ effect. 



j/'p' is the noise covariance matrix (which is taken 



to be diagonal in pixel space and v, i.e., iVpix,jyp,i/'j: 
alpS^^^Spp'), and C, 



C^l^'ylbij'iyim,pyim,-, 



is 



the signal covariance matrix of CMB convolved with the 
beam (C/ and b^i are the CMB power spectrum and 
the beam transfer function, respectively). A matrix Fa/^ 

gives the la error of pa as {F~^)aa, and is given by 



h 



(68) 



For di,p, we use the foreground-cleaned V- and W- 
band temperature maps at the HEALPix resolution of 
Nside = 1024, masked by the KQ75y7 mask. Note that 
the KQ75y7 mask includes the 7-year source mask, which 
removes a potential bias in the reconstructed profile due 
to any sources which are bright enough to be resolved 
by WMAP, as well as the sources found by other sur- 
veys. Specifically, the 7-year point source m ask includes 
sourc es in the 7- year WMAP sour c e cata log ([Gold et al.l 
l2Ol0h : sources from IStickel et al.l (Il99l: sources with 
22 GHz fluxes > 0.5 Jy fr om iHirabavashi et al. (2000); 
flat spectrum objects from iTerasranta et al.T2001.'): and 
sou rces from the blazar survev of lPerlman et al.l (|1998[ ) 
and lLandt et al.l (|2001[ ). 

In Figure 1141 we show the measured angular radial 
profiles of Coma in 16 angular bins (separated by — 
20 arcmin), in units of the Rayleigh-Jeans temperature, 
for the V- and W-band data, as well as for the V-l-W 
combined data. The error bar at a given angular bin is 



F. 



al3 (^/9),.'p'[^pix + C]^,p, 



i^pdi^p, 



(67) given by y/{F^c 
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We find that all of these measurements agree well at 
9 > 130 arcmin; however, at smaller angular scales, 
9 < 110 arcmin, the V+W result shows less SZ than both 
the V- and W-only results. Does this make sense? As de- 
scribed in Appendix [Cl our optimal estimator uses both 
the C~^-weighted V-(-W map and the A^^^-weighted 
V— W map. While the latter map vanishes for CMB, 
it does not vanish for the SZ effect. Therefore, the latter 
map can be used to separate CMB and SZ effectively 

This explains why the V-l-W result and the other 
results are different only at small angular scales: at 
9 > 130 arcmin, the measured signal is |AT| < 50 /iK. 
If this was due to SZ, the difference map, V— W, would 
give |AT| < (1 - 1.56/1.81) x 50 /xK ~ 7 /xK, which is 
smaller than the noise level in the difference map, and 
thus would not show up. In other words, our estimator 
cannot distinguish between CMB and SZ at 61 > 130 ar- 
cmin. 

On the other hand, at ^ < 110 arcmin, each of the 
V- and W-band data shows much bigger signals, |AT| > 
100 /iK. If this was due to SZ, the difference map would 
give |Ar| > 14 ^K, which is comparable to or greater 
than the noise level in the difference map, and thus would 
be visible. We find that the difference map does not 
detect signals in 50 < < 110 arcmin, which suggests 
that the measured signal, —100 /xK, is not due to SZ, 
but due to CMB. As a result, the V-l-W result shows less 
SZ than the V- and W-only results. 

In order to quantify a statistical significance of detec- 
tion and interpret the resul t, we model the SZ profile 
using a spherical /3 model (jCavaliere fc Fusco-Femianol 
[19761) : 

ATszi9) - ATsz(O) [l + {9/9,f] ^'-'"^Z' . (69) 

To make our analysis consistent with previous measure- 
ments described later, we fix the core radius, 9c, and the 
slope parameter, /?, at 9c — 10.5 arcmin and /3 — 0.75 
(|Briel et al.lll992l) . and vary only the central decrement, 
ATsz(O). In this case, the optimal estimator is 



Arsz(O) = -Vp'I^Pix 



(70) 



where t^p is a map of the above /3 model with ATgz(O) = 
1, and 

^ = W[^pix + ^];,;,,,pt.p, (71) 



gives the la error as l/vF. 
For V-l-W, we find 

ATsz,Rj(0) = -377 ± 105 ^iK (68% CL), 

which is a 3.6cr measurement of the SZ effect toward 
Coma. In terms of the Compton y-parameter at the cen- 
ter, we find 



ywMAp{0) = ■ 



1 ATsz,Rj(0) 



cmb 



: (6.9 ± 1.9) X IQ- 



CL). 



Let us comp are this measureme nt with the previous 
measurements . iHerbig et all ()1995D used the 5.5-m tele- 
scope at the Owens Valley Radio Observatory (OVRO) 
to observe Coma at 32 GHz. Using the same 6c and f3 as 
above, they found the central decrement of Arsz,Rj(0) = 



-505 ±92 /iK (68% CL), after subtracting 38 /iK due to 
point sources (5C4.81 and 5C4.85). These sources have 
been masked by our point-source mask, and thus we do 
not need to correct for point sources. 

While our estimate o f ATsz,rj(0) is different from that 
of IHerbig et al.l ()1995[ ) only by 1. 2 a, and thus issta- 
tistically consistent, we note that IHerbig et al.l ()1995f ) 
did not correct for the CMB fluctuation in the direc- 
tion of Coma. As the above results indicate that the 
CMB fluctuation in the direction of Coma is on the or- 
der of —100 /iK, it is plausible that the OVRO measure- 
ment implies Arsz.Rj(O) ^ — 400K, which is an excellent 
agreement with the WMAP measurement. 

The Coma cluster has been observed also by the Mil- 
limetre and Infrared Testagrigia Observatory (MITO) 
experiment (iDe Petris et al. 20Q 2.) . U sing the same 9c 
and l3 as above. iBattistelh et°aLl^003l ) found Arsz(O) = 
-184 ± 39, -32 ± 79, and +172 ± 36 /zK (68% CL) at 
143, 214, and 272 GHz, respectively, in units of ther- 
modynamic temperatures. As MITO has 3 frequencies, 
they were able to separate SZ, CMB, and the atmo- 
spheric fluctuation. By fitting these 3 data points to 
the SZ spectrum, Arsz/Tcmb = g,yy, we find ?/mito(0) = 
(6.8 ± 1.0 ± 0.7) X 10~^, which is an excellent agreement 
with the WMAP measurement. The first error is sta- 
tistical and the second error is systematic due to 10% 
calibration error of MITO. The calibra tion error of the 
WMAP data fO.2%: [Jarosik et al.|[2010l ) is negligible. 

Finally, one may try to fit the multi-wavelength data 
of Tsz(O) to separate the SZ effect and CMB. For this 
purpose, we fit the WMAP data in V- and W-band to 
the /3 model without correcting for the CMB fluctuation. 
We find -381 ± 126 /iK and -523 ± 127 /iK in ther- 
modynamic units (68% CL). Th e OVRO mea surement, 
Jsz,Rj(0) = -505 ±92 fiK (Hcr big et al.lll995D . has been 
scaled to the Rayleigh- Jeans temperature with the SZ 
spectral dependence correction, and thus we use this 
measurement at = 0. Fitting the WMAP and OVRO 
data to the SZ effect plus CMB, and the MITO data 
only to the SZ effect (because the CMB was already re- 
moved from MITO using their multi-band data), we find 
y(0) = (6.8 ±1.0) X 10-5 and Arcmb(O) = -136 ±82 /iK 
(68% CL). This result is consistent with our interpre- 
tation that the y-parameter of the center of Coma is 
7 X 10~5 and the CMB fluctuation is on the order of 
-100 /iK. 

The analysis presented here shows that our optimal 
estimator is an excellent tool for extracting the SZ effect 
from multi-frequency data. 

7.3. Nearby Clusters: Vikhlinin et al. 's low-z sample 

The Coma cluster is the brightest SZ cluster on the 
sky. There are other clusters that are bright enough to 
be seen by WMAP. 

7.3.1. Sample of nearby (z < Q.l) clusters 

In order to select candidate s, we use the sample of 
49 nearby clusters compiled bv I Vikhlinin et al. (2009a), 
which are used by th e cosmological analysis given in 
iVikhlinin et al.l ()2009bf) . These clusters are selected from 
the ROSAT All-sky Survey, and have detailed follow-up 
observations by Chandra. The latter property is espe- 
cially important, as it allows us to directly compare the 
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Arnaud et al. - — X-ray data (Vikhlinin et al.) 
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Fig. 15. — Angular radial profiles of the SZ effect toward nearby massive clusters (with M500 > 4 X lO^** Mq and 2 < 0.09), in units 
of the Rayleigh- Jeans (RJ) temperature (/iK). The V- and W-band data are combined optimally to separate the CMB and the SZ effect. 
All of these clusters have 6500 > 14', i.e., resolved by the WMAP beam. The masses, M500, are My given in the 6th column of Table 2 in 
[Vikhlinin et al.. ^2009a), times /ivikhlinin = 0.72 used by them, except for Coma. For Coma, we estimate M500 using the mass-temperature 
relation given in (Vikhlinin ct al. (2009a) with the temperature of 8.45 keV (Wik et al. 2009). The dashed lines show the expected SZ 
effect from the X-ray data on the individual clusters, whereas the solid lines show the prediction from the average pressure profile found by 
lArnaud et al.l II2010I) . Note that Coma is not included in the sample of Vikhlinin ct al. (2009a), and thus the X-ray data are not shown. We 
find that Arnaud et al.'s profiles overpredict the gas pressure (hence the SZ effect) of non-cooling flow clusters. Note that all cooling-flow 
clusters are "re laxed," and all non-cooling-flow clusters are "non-relaxed" (i.e., morphologically disturbed), according to the criterion of 
IVikhlinin eTall ((2009a). 



measured SZ effect in the WMAP data and the expected 
one from the X-ray data on a cluster- by- cluster basis, 
without relying on any scali ng relations 

Not all nearby clusters in IVikhlinin et all ()2009a|) are 
suitable for our purpose, as some clusters are too small 
to be resolved by the WMAP beam. We thus select the 
clusters that have the radius greater than 14' on the sky: 
specifically, we use the clusters whose ^590 = r^Qo/DAiz) 
is greater than 14'. Here, r5oo is the radius within which 



For this reason, the anal ysis given in this section is "cleaner" 
than the one given in Section 17.41 which uses a larger number of 
clusters but relies on scaling relations. Nevertheless, the res ults 
obtained from the analysis in this section and those in Section 17.41 
are in good agreement. 



the mean overdensity is 500 times the critical density 
of the universe. We find that 38 clusters satisfy this 
condition. (Note that the Coma cluster is not included 
in this sample.) 

Of these, 5 clusters have M500 > 6 x 10" h-^ Mq, 
7 clusters have 4 x lO" ft."^ Mq < M500 < 6 x 
IQi^ h-^ Mq, 13 clusters have 2x10" h''^ Mq < M500 < 
4x 10" /i^i Mq, and 13 clusters have 1 x lO" /i"^ Mq < 
M500 < 2 X 10" h-^ Mq. Here, M500 is the mass en- 
closed within r5oo, i.e., M500 = M{r < rsoo). 

Finally, we remove the clusters that lie within the 
KQ75y7 mask (including the diffuse and the source 
mask), leaving 29 clusters for our analysis. (1 cluster 
(A478) in 4 x 10" h-^ Mq < M500 < 6 x lO" Mq, 
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TABLE 12 

Best-fitting Amplitude for the SZ Effect in the WMAP 

7-YEAR DATA 



Mass Range 


# of clusters Vikhlinin et al.^ 


Arnaud ct al. 


6 < M500 < 9 


5 


0.90 ± 0.16 


0.73 ± 0.13 


4 < M500 < 6 


6 


0.73 ± 0.21 


0.60 ± 0.17 


2 < M500 < 4 


9 


0.71 ± 0.31 


0.53 ± 0.25 


1 < M500 < 2 


9 


-0.15 ± 0.55 


-0.12 ± 0.47 


4 < Msoo < 9 


11 


0.84 ± 0.13 


0.68 ± 0.10 


1 < Maoo < 4 


18 


0.50 ± 0.27 


0.39 ± 0.22 


4 < M500 < 9 








cooling flow^ 


5 


1.06 ± 0.18 


0.89 ± 0.15 


non-cooling flow^ 


6 


0.61 ± 0.18 


0.48 ± 0.15 


2 < M500 < 9 


20 


0.82 ± 0.12 


0.660 ± 0.095 


1 < M500 < 9 


29 


0.78 ± 0.12 


0.629 ± 0.094 



^ In units of 10 h Mq. Coma is not included. The masses 
are derived from the mass-Yx relation, and are given in the 6th 
column of Table 2 in lVikhl mm et al.l I l2009al) . times /ivikhlinin — 
0.72. 

^ Derived from the X-ray data on the individual clusters 
JVikhnnin"c t al. 2009a). 

^ The "universal pressure profile" given bv lArnaud et all l| 2010f) . 
^ Defi nition of "cooling flow" follows that of ^ikhlinin ct al.| 
]2007D . All of cooling-fiow clusters here are also "relaxed,'' ac- 
cording to the criterion of iVikhlinin et al.| J2009a[). 
^ Defi nition of "non-cooling flow" follows that of [Vikhlinin et all 
]2007D . All of non-cooling-flow clusters here are also "non- 
relaxed" (or mer gers or morphologically disturbed), according 
to the criterion of IVikhlinin et all | |2009aD . 

4 clusters in 2x lO" ft."^ M© < M500 < 4x 10" h-^ Mq, 
and 4 clusters in 1 x 10" M© < M500 < 2 x 
10"'^''' Mq are masked, mostly by the point source 
mask.) The highest redshift of this sample is z = 0.0904 
(A2142). 

7.3.2. WMAP versus X-ray: cluster-by-cluster comparison 

In Figure [T5l we show the measured SZ effect in the 
symbols with error bars, as well as the expected SZ from 
the X-ray data in the dashed lines. 

To compute the expected SZ, we use equation (p5|) with 
Pe ~ neksTe, where Ue and Tg are fits to the X-ray 
data. Specifically, we use (see equation (3) and (8) of 
iVikhhnin et"aL 20060 



nlir)-- 
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1 
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(73) 



where x = r/r^oQ. The parameters in the above equa- 
tions are found from the Chandra X-ray data, and kindly 
made available to us by A. Vikhlinin. 

For a given pressure profile, Pe{f)i we compute the SZ 
temperature profile as 



ATszie)=g.T,^b^Pfie) 



273 tiK 



25 eV cm-3 Mpc 



(74) 



With a typo in equation (8) corrected (A. Viklilinin 2010, 
private communication) . 



where P^'^{6) is the projected electron pressure profile on 
the sky: 



Pl\0) = 



dl R 



(75) 



Here, we truncate the pressure profile at Tout- We take 
this to be rout = Srsoo. While the choice of the boundary 
is somewhat arbitrary, the results are not sensitive to 
the exact value because the pressure profile declines fast 
enough. 

We find a good agreement between the measured and 
expected SZ signals (see Figure [15]), except for A754: 
A754 is a merging cluster with a highly disturbed X-ray 
morphology, and thus the expected SZ profile, which is 
derived assuming spherical symmetry (equation l74|) . may 
be different from the observed one. 

To make the comparison quantitative, we select clus- 
ters within a given mass bin, and fit the expected SZ 
profiles to the WMAP data with a single amplitude, a, 
treated as a free parameter. The optimal estimator for 
the normalization of pressure, a, is 



i.'p'[^pix + q;v. 



(76) 



where t,yp is a map containing the predicted SZ profiles 

around clusters, and the la error is where F is 

given by equation ([7T|) . 

We summarize the results in the second column of Ta- 
ble [121 We find that the amplitudes of all mass bins 
are consistent with unity (a — 1) to within 2a (except 
for the "non-cooling flow" case, for which a is less than 
unity at 2.2cr; we shall come back to this important point 
in the next section). The agreement is especially good 
for the highest mass bin (M500 > 6 x 10^"* h^^ Mq), 
a = 0.90 ±0.16 (68% CL). 

Note that this is a 5.6a detection of the SZ effect, 
just from stacking 5 clusters. By stacking 11 clusters 
with M500 > 4 X 10" AIq (i.e., all clusters in 

Figure[nibut Coma), we find a = 0.84±0.13 (68% CL), 
a 6.5(7 detection. In other words, one does not need 
to stack many tens or hundreds of clusters to see 
the SZ effect in the WMAP data, cont rary to what 
is cor nmonly done in the literature (iFosalba et aTl 



2003t i Hernandez-Mont eagudo k, Rubiho-Martfn 
Hernan dez-Monteagudo et al. 20041: Myers et al 



2004; 



2004: 



Afshordi et al. 2005; Lieu et al. 2006; Bielbv & Shankd 
2007; Afshordi et al. 2007 : Atrio-Barandela et aLl 
2008; Kashlinskv et^ [20(381 : iDiego fc Partridge! I2009r 
iMelin et al.. .20101 

From this study, we conclude that the WMAP data 
and the expectation from the X-ray data are in good 
agreement. 



7.3.3. 



WMAP versus a "universal pressure profile'' 
Arnaud et al. : effect of recent mergers 



of 



Recently. I Arnaud et al.l ()2010l ) derived pressure profiles 
of 33 clusters from the X-ray follow-up observations of 
the REXCESS clusters using XMM-Newton. The REX- 
CESS sample co ntains clusters selected from the ROSAT 
All-sky Survey ([Bohringer et al.|[2007D . By scaling the 
pressure profiles appropriately by mass and redshift and 
taking the median of the scaled profiles, they produced 
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a "universal pressure profile." We describe this profile in 
Appendix ID.ll 

We show the predicted ATsz{d) from Arnaud et al.'s 
pressure profile in Figure [15] (solid lines) . In order to 
compute their profile, we need the mass of clusters, 
M500- We take Mnnn f r om th e the 6th column of Ta- 
ble 2 in lVikhlinin et all ()2009a[ ). which are derived from 
the so-called mass- Fx relation, the most precise mass 
proxy known to date with a scatter of about 5%F^ 
Again, we take the outer boundary of the pressure to 
be Tout = 6r5oo. 

We fit Arnaud et al's profiles to the WMAP data of 
29 clusters. We find that, in all but one of the mass 
bins, the best-fitting normalization, a, is less than unity 
by more than 2a. By stacking 11 clusters with Mqqq > 
4x 10" h-^ Mq, we find a = 0.68±0.10 (68% CL). This 
measurement rules out a = 1 by 3.2cr. The universal 
pressure profile overestimates the SZ effect by ~ 30%. 

What causes the discrepancy? The thermal/dynamical 
state of gas in clusters may be the culprit. From Fig- 
ure [151 we find that the X-ray data (hence the SZ effect) 
and the universal profile agree well for "cooling fiow" 
clusters, but do not agree for non-cooling flow clusters. 

The cooling flow clusters have cool cores, in which the 
cooling time (due to Bremsstra hlung) is shorter than 
the Hubble time ([Fabianl I1994D . The clusters shown 
in Figure [T5| are classified as either "cooling fiow" or 
"non-cooling flow" cl usters, following the definition of 
IVikhlinin et all (12001 . 

We find that Arnaud et al.'s profiles agree with the X- 
ray data on the individual c lusters well at 9 > O .305oo. 
This agrees with Figure 8 of I Arnaud et~all (|20m ). The 
profiles differ significantly in the inner parts of clusters, 
which is also i n goo d agreement with the conclusion of 
I Arnaud et al.l (|2010f ): they find that cool-core clusters 
show much steeper inner profiles than non-cool-core clus- 
ters (their Figure 2 and 5). 

For cooling-fiow clusters, the agreement between the 
WMAP data and Arnaud et al.'s profile is good: a = 
0.89±0.15 (68% CL). However, for non-cooling-fiow clus- 
ters, we find a very low amplitude, a = 0.48 ± 0.15 
(68% CL), which rules out Arnaud et al.'s profile by 3.5cr. 
A similar trend is also observed for the individual X-ray 
data of Vikhlinin et al.: a = 1.06 ± 0.18 and 0.61 ± 0.18 
(68% CL) for cooling-flow and non-cooling-flow clusters, 
respectively; however, statistical significance is not large 
enough to exclude a = 1. 

Based on this study, we conclude that one must distin- 
guish between cool-core (cooling flow) and non-cool-core 
clusters when interpreting the observed profile of the SZ 
effect. It is clear (at the 3.2cr level) that Arnaud et al.'s 
profile is inconsistent with the individual X-ray data and 
the SZ data taken by WMAP, and (at the 3.5a level) one 
must distinguish between the cool-core and non-cool-core 
clusters. 

Interestingly, all of cooling-flow clusters are "relaxed" 
clusters, and all of non-cooling-flow clusters are "non- 

The except ion is Coma, which is not included in the 
nearby sample of IVikhlinin et al.l Il2009a|). Ther efore, wc use 
the mass-temperature relation of IVikhlinin et al.l ([2009a) (the 
first row of Table 3) for this cluster: M500 = (3.02 ± 0.11) X 
lO^-* /i-i Mq{Tx/5 keV)i'53±0-''8/£;(^), with E{z) = 1.01 for 
Coma's redshift, z = 0.023. We use the X-ray temperature of 
Tx = 8.45 ± 0.06 kcV (,Wik et al 2009f l. 



relaxed" (i.e., morphologic ally disturbed) c lusters , ac- 
cording to the criterion of IVikhlinin et al.l (|2009aD . If 
we interpret this as non-cooling-flow clusters having un- 
dergone recent mergers, then we may conclude that we 
are flnding the effect of mergers on the SZ effect. 

While our conclusion is still based on a limited number 
of clusters, it may be valid for a much larger sample of 
clusters, as we shall show in Section FT. 5. 41 

Finally, we note that the current generation of hydro- 
dynamical simulations predict the pressure proflles that 
ar e even steeper than A rnard et al.'s profile (see Figure 7 
of lArnaud et al.ll2010( ). Therefore, the simulations also 
overpredict the amount of pressure in clusters relative to 
the WMAP data. We shall come back to this point in 
Section [7.5.51 

7.4. Statistical Detection of the SZ Effect 

To explore the SZ effect in a large number of clusters, 
we use a galaxy cluster catalog consisting of the fJOSAT- 
ESO flux-limited X-ray (REFLEX) galaxy cluster survey 
(jBohringer et al.ll2004f ) in the southern hemisphere above 
the Galactic plane {6 < 2.5° and |6| > 20°) and t he ex- 
tended Bri ghtest Cluster Sample (eBCS; Ebelin g~et al.l 
119981 12000( ) in the northern hemisphere above the Galac- 
tic plane {6 > 0° and |&| > 20°). Some clusters are 
contained in both samples. Eliminating the overlap, this 
catalog contains 742 clusters of galaxies. Of these, 400, 
228, and 114 clusters lie in the redshift ranges of 2; < 0.1, 
0.1 < z < 0.2, and 0.2 < 2 < 0.45, respectively. 

We use the foreground-reduced V- and W-band maps 
at the HEALPix resolution of iVgidc — 1024, masked by 
the KQ75y7 mask, which eliminates the entire Virgo clus- 
ter. Note that this mask also includes the point-source 
mask, which masks sources at the locations of some clus- 
ters (such as Coma). After applying the mask, we have 
361, 214, and 109 clusters in z < 0.1, 0.1 < z < 0.2, and 
0.2 < z < 0.45, respectively. 

We again use equation (157]) to find the angular radial 
profile in four angular bins. For this analysis, {ta)up 
is a map containing many annuli (one annulus around 
each cluster) corresponding to a given angular bin a, 
convolved with the beam and scaled by the frequency 
dependence of the SZ effect. 

We show the measured profile in the top panel of Fig- 
ure [T6| We have done this analysis using 3 different 
choices of the maximum redshift, Zmax, to select clus- 
ters: Zmax = 0.1, 0.2, and 0.45. We find that the results 
are not sensitive to Zmax- As expected, the results for 
2max = 0.1 have the largest error bars. The error bars 
for Zmax = 0.2 and 0.45 are similar, indicating that we do 
not gain much more information from z > 0.2. The er- 
ror bars have contributions from instrumental noise and 
CMB fluctuations. The latter contribution correlates the 
errors at different angular bins. 

The top panel shows a decrement of —3.6 ± 1.4 fiK at 
a very large angular distance from the center, ^ = 63 ar- 
cmin, for Zmax = 0.2. As we do not expect to have 
such an extended gas distribution around clusters, one 
may wonder if this result implies that we have a bias in 
the zero level. In order to check for a potential system- 
atic bias, we perform the following null test: instead of 
measuring the SZ signals from the locations of clusters, 
we measure them from random locations in the WMAP 
data. In the middle panel of Figure [THJ we show that our 
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Fig. 16. — Average temperature profile of the SZ effect from tfie 
stacking analysis, in units of tfie Rayleigh-Jeans (RJ) temperature 
(^K), at 9 = 7, 35, 63, and 91 arcmin. The V- and W-band data 
are combined using the optimal estimator. (Top) The SZ effect 
measured from the locations of clusters of galaxies. The results 
with three different maximum redshifts, 2;max = 0.1 (blue; left), 
0.2 (green; middle), and 0.45 (red; right), are shown. The error 
bars include noise du e to the CMB fluctuation, and thus are cor- 
related (see equation II77I I for the correlation matrix). (Middle) A 
null test showing profiles measured from random locations on the 
sky (for Zmax = 0.2; the number of random locations is the same 
as the number of clusters used in the top panel). Three random 
realizations are shown. Our method does not produce biased re- 
sults. (Bottom) The measured profile (zmax = 0.2) is compared 
with the model profiles derived from the median of 33 clusters 
in the REXCESS sample (Arnaud et al. 2010) and theoretically 
calculated from hydrostatic equilibrium (Komatsu & Scljak 2001) 
with two different concentration parameters. Note that the model 
profiles are calculated also for Zmax = 0.2 but have not been mul- 
tiplied by the best-fitting normalization factors given in Table [T3l 
The theoretical profiles are processe d in the same manner that the 
data are processed, using equation l|67|l . 



method passes a null test. We find that the measured 
profiles are consistent with zero; thus, our method does 
not introduce a bias. 

Is this signal at a degree scale real? For example, 
are there nearby massive clusters (such as Coma) which 
give a significant SZ effect at a degree scale? While the 
Virgo cluster has the largest angular size on the sky, 
the KQ75y7 mask eliminates Virgo. In order to see if 
other nearby clusters give significant contributions, we 
remove all clusters a.t z < 0.03 (where there are 57 clus- 
ters) and remeasure the SZ profile. We find that the 
changes are small, less than 1 /iK at all angular bins. At 
9 = 63 arcmin, the change is especially small, ~ 0.1 /iK, 
and thus nearby clusters do not make much contribution 
to this bin. 

The apparent decrement at = 63 arcmin is probably 
due to a statistical fiuctuation. The angular bins are 
correlated with the following correlation matrix: 



/ 1 0.5242 0.0552 0.0446 \ 

0.5242 1 0.4170 0.0638 

0.0552 0.4170 1 0.4402 

V 0.0446 0.0638 0.4402 1 / 



(77) 



where the columns correspond to 9 = 7, 35, 63, and 
91 arcmin, respectively. The decrements at the first two 
bins (at 9 = 7 and 35 arcmin) can drive the third bin at 
9 = 63 arcmin to be more negative. Note also that one 
of the realizations shown in the bottom panel ( "Random 
1" in the middle panel of Figure fTH]) shows ~ —3.5 /iK 
at = 63 arcmin. The second bin is also negative with 
a similar amplitude. On the other hand, "Random 2" 
shows both positive temperatures at the second and third 
bins, which is also consistent with a positive correlation 
between these bins. 

Finally, in the bottom panel of Figure 1161 compare 
the measured SZ profile with the expected profiles from 
various cluster gas models (described in Section 17. 5p . 
None of them show a significant signal at = 63 arcmin, 
which is also consistent with our interpretation that it is 
a statistical fluctuation. 

7.5. Interpretations 
7.5.1. General idea 

In order to interpret the measured SZ profile, we need 
a model for the electron pressure profile, Pe{i') (see equa- 
tion ([65l) ). For fully ionized gas, the electron pressure is 
related to the gas (baryonic) pressure, Pgas(?'), by 



Pe{r) 



2 + 2X 



(78) 



where X is the abundance of hydrogen in clusters. For 
X = 0.76, one finds Pe{r) = 0.518Pgas(r). 

We explore three possibilities: (i) Arnaud et al.'s pro- 
file that we have used in Section [731 (ii) theoretical pro- 
files derived by assuming that the gas pressure is in hy- 
drostatic equilibrium with gravitat ional potential given 
by an Navarro-Frenk- White (NFW; Navarro e t"aIlll997D 
mass density profile (Komatsu & Seljak 2001J, and (iii) 
theoretical profiles from hydrodynamical simulations of 
clusters of gala xies with and w ithout gas cooling and star 
formation (Na gai et aIll2007D . 

The case (ii) is relevant because this profile is 
used in the calculation of the SZ power spectrum 
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(|Komatsu fc Selia^l2002[ ) that has been used as a tem- 
plate to marginalize over in the cosmolo gical parame- 
ter estimation since the 3-year analysis (iSpergel et al.l 
[20071 iDmiklev et al.l[200l iLarson et al.ll2010t ). Analvt- 
ical models and hydrodynamical simulations for the SZ 
signal are also the basis for planned efforts to use the SZ 
signal to constrain cosmological models. 

As we have shown in the previous section using 29 
nearby clusters, Arnaud et al.'s pressure profile overpre- 
dicts the SZ effect in the WMAP data by - 30%. An 
interesting question is whether this trend extends to a 
larger number of clusters. 

7.5.2. Komatsu-Seljak profile 

The normalization of the KS profile has been fixed by 
assuming that the gas density at the virial radius is equal 
to the cosmic mean baryon fraction, rih/^m, times the 
total mass density at the virial radius. This is an upper 
limit: for example, star formation turns gas into stars, 
reducing the amount of gas. KS also assumes that the 
gas is virialized and in thermal equilibrium (i.e., elec- 
trons and protons share the same temperature) every- 
where in a cluster, that virialization converts potential 
energy of the cluster into thermal energy only, and that 
the pressure contributed by bulk flows, cosmic rays, and 
magnetic fields are unimportant. 

We give details of the gas pressure profiles in Ap- 
pendix |D] In the top left panel of Figure [ITl we sh ow 
Arnaud et al.'s pressure profiles (see Appendix ID.lJj in 



the solid lines, and the KS profiles (see Appendix ID.2P 
in the dotted and dashed lines. One of the inputs for the 
KS profile is the so-called concentration parameter of the 
NFW profile. The dotted line is for the concentration pa- 
rameter of c = 10(Mvir/3.42 X 10^2 /t -i Mr^)-°-^/(l + z) 
(MiaS '2003), which was used by iKomatsu fc SeliaM 
(2002) for their calculation of the SZ power spectrum. 
Here, Afvir is the virial mass, i.e., mass enclosed within 
the virial radius. The dashed line is for c = 7.85(Mvir/2 x 
10^2 h-i Mq)-0 08V(1 + z)0-7\ which was found from 
recent N-body simulat ions with the WM AP 5-year cos- 
mological parameters (jPuffv et al.ll2008[ ). 

We find that the KS profiles and Arnaud et al.'s profiles 
generally agree. The agreement is quite good especially 
for the KS prof ile with the concentration parameter of 
iDuffv et al.l (|2008i) . The KS profiles tend to overestimate 
the gas pressure relative to Arnaud et al.'s one for low- 
mass clusters {Mq < 10^'' Mq). Can we explain this 
trend by a smaller gas mass fraction in clusters than the 
cosmic mean? To answer this, we compute the gas mass 
fraction by integrating the gas density profile: 



in J^"-"" r^dr Pgas(r) 



gas 



M. 



500 



M. 



500 



(79) 



where M^oo arid Mgas,5oo are the total mass and gas mass 
contained within t^qq, respectively. 

In Figure [ 18[ we sh ow /gas from X-ray observations 
(fVikhhnin'^tral.l l2009aD : 

/gas(V0-72)3/2 ^ 0.125+0.037 logio(Af5oo/10i^ h-^ Mq), 

(80) 

for h — 0.7, and /gas fro m the KS profi les w ith the con- 
centration parameters of iSeliakI (|2OOO0 and IDuffv et al.l 
((2008,). We find that the KS predictions, /gas ^ 0.12, 



are always much smaller than the cosmic mean baryon 
fraction, flf,/rim = 0.167, and are nearly independent of 
mass. A slight dependence on mass is due to the depen- 
dence of the concentration parameters on mass. While 
the KS profile is normalized such that the gas density at 
the virial radius is flb/flm times the total mass density, 
the gas mass within rspo is much smaller than flb/flm 
times M500, as the gas density and total matter density 
profiles are very different near the center: while the gas 
density profile has a constant-density core, the total mat- 
ter density, which is dominated by dark matter, increases 
as pm (X 1/r near the center. 

However, the behavior of /gas measured from X-ray 
observations is very different. It has a much steeper 
dependence on mass than predicted by KS. The rea- 
son for such a steep dependence on mass is not yet un- 
derstood. It could be due to star formation occurring 
more effectively in lower mass clusters. In any case, for 
^500 = 3 X 10^^ Mq, the observed gas mass fraction 
is /gas — 0.11, which is only 10% smaller than the KS 
value, 0.12. For M^oa = 3 x 10^^ h^^ Mq, the observed 
gas mass fraction, 0.08, is about 30% smaller than the 
KS value. This is consistent with the difference between 
the KS and Arnaud et al.'s pressure profiles that we see 
in Figure [T71 thus, once the observed mass dependence 
of /gas is taken into account, these profiles agree well. 

To calibrate the amplitude of gas pressure, we shall use 
the KS pressure profile (without any modification to /gas) 
as a template, and find its normalization, a, from the 
WMAP data using the estimator given in equation (175)) . 
We shall present the results for hydrodynamical simula- 
tions later. 

For a given gas pressure profile, Pgasir), we com- 
pute the electron pressure as Pg = 0.518Pgas (see equa- 
tion (|78l)). We then use equation ([74]) to calculate 
the expected SZ profile, Argz(6'). We take the outer 
boundary of the pressure to be 3 times the virial radius. 
To ut = 3rvir, which is the s ame as the parameter used 
by Komats u fc Selia3 ()2002|) . In the right panels of Fig- 
ure [iTl we show the predicted ATsz {&) , which will be 
used as templates, i.e., t,^p. 

7.5.3. Lumtnosity-size relation 

Now, in order to compute the expected pressure pro- 
files from each cluster in the catalog, we need to know 
''500 • We calculate r5oo from the observed X-ray lumi- 
nosity in ROSATs 0.1-2.4 keV band, Lx, as 



f500 = 



(0.753 ±0.063) h^^ Mpc 



Eiz) 
Lx 



1044 /i-2 erg s-i 



0.228±0.015 



(81) 



where E{z) = H{z)/Ho = + z)^ + VLkY'"^ for a 

ACDM model. This is an empirical rel ation found from 
X-ray observations (see equation (2) of IBohringer et all 
2007*) based upon the temperature-Lx relation from 
Ikebe et al] (I2002D and t he ''500-temperature relation 
from Arnaud et all (pOOl . The error bars have been 
calculated by propagating the errors in the temperature- 
Lx and rsoo-temperature relations. Admittedly, there 
is a significant scatter around this relation, which is the 
most dominant source of systematic error in this type of 
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Fig. 17. — Gas pressure profiles of clusters of galaxies, Pgas(»'), at z = 0.1, and the projected profiles of the SZ effect, ATgz(9) 
(Rayleigh- Jeans temperature in /iK). (Top Left) The gas pressure profiles. The upper and bottom set of curves show M500 = 3 X 10^* and 
3 X 10^^ A<f0, respectively. The horizontal axis shows radii scaled by the corresponding rsoo = 0.78 and 0.36 Mpc, respectively. 
The solid lines show Pgas{r) = Pe(»')/0.518 derived from X-ray observ ations (Arnaud ct al. 2010), while the dotted and dashed lines 
show P gasC?*) predicted from hydrostatic equilibrium IIKomatsu fc Seliakll2001i ) with NFW concentration parameters of ISeliafl II2000I ) and 
IDuffv e t al. (2008), respectively. (Top Right) The projected SZ profiles computed from the corresponding curves in the top left panel and 
equation I I74II . The horizontal axis shows angular radii scaled by S500 = ''soo/^'Ai which is 10 and 4.7 arcmin for M500 = 3 X 10^** and 
3 X 10^^ A^0, respectively. (Bottom Left) Same as the top lef t panel, but t he do tted and dashed lines show Pgas(?') predicted from 
"Cooling-|-Star Formation" and "Non-radiative" simulation runs bv lNaeai etaLl HWOTh . (Bottom Right) Same a s th e top right panel, but 
the dotted and dashed lines are computed from the corresponding curves in the bottom left panel and equation Il74|l . 
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Fig. 18. — Gas mass fraction as a function of Msoo- The 
thick horizontal hne shows the cosmic mean baryon fraction, 
fis/Hm = 0.167. The solid line shows the gas mass fraction, fgaa 

' /M500, derived from X-ray observations (Vikhli nin et al.l 

1200 9a') , while the dotted and dashed lines show /f^as predicted from 
hydrostatic equilibrium (Komatsu & Seliak 2001) with NFW con- 
centration parameters of lSeFiaSl (120001 ) and lDuffv et al.l ( 120081 ). re- 
spectively. 
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Fig. 19. — Distribution of M500 estimated from clusters in the 
catalog using the me asu red X -ray luminosities in 0.1-2.4 keV band, 
Lx, and equations l|81|l and (|D2|| . The light blue, dark blue, and 
pink histograms show Zmax = 0.45, 0.2, and 0.1, respectively. 

analysis. (The results presented in Section 17.31 do not 
suffer from this systematic error, as they do not rely on 
relations.) As M500 oc r|gQ, a ~ 10% error in 
the predicted values of rsoo gives the mass calibration 
error of ~ 30%. Moreover, the SZ effect is given by Afsoo 

times the gas temperature, the latter being proportional 

2/3 

to M5QQ according to the virial theorem. Therefore, the 
total calibration error can be as big as ~ 50%. 

In order to quantify this systematic error, we re- 
peat our analysis for 3 different size-luminosity relations: 
(i) the central values, (ii) the normalization and slope 
shifted up by Icr to 0.816 and 0.243, and (iii) the nor- 
malization and slope shifted down by Icr to 0.690 and 
0.213. We adopt this as an estimate for the systematic 
error in our results due to the size-luminosity calibration 
error. For how this error would affect our conclusions, 
see Section [7771 

Note that this estimate of the systematic error is con- 
servative, as we allowed all clusters to deviate from the 



best-fit scaling relation at once by ±1ct. In reality, the 
nature of this error is random, and thus the actual er- 
ror caused by the sc atter in the scalin g relation would 
probably be smaller. iMelin et al.l (|2010( ) performed such 
an analysis, and found that the systematic error is sub- 
dominant compared to the statistical error. 

Nevertheless, we shall adopt our conservative estimate 
of the systematic error, as the mean scaling relation also 
varies from a uthors to authors. The mean scaling rela- 
tions used bv IMelin et al.l (|2010f) are within the error bar 
of the scaling relation that we use (equation (I5T|) '). 

In Figure [111 we show the distribution of M500 esti- 
mated from clusters in the catalog using the measured 
values of Lx and equations (|8T|) and (|D2|) . The distri- 
bution peaks at M500 ^ 3 x 10^'* M© for Zmax = 0.2 
and 0.45, while it peaks at A/500 ~ 1-5 x lO" hr'^ Mq 
for Zmax = 0.1. 

7.5.4. Results: Arnaud et al. 's profile 

For Arnaud et al.'s pressure profile, we find the best- 
fitting amplitudes of a = 0.64 ± 0.09 and 0.59 ± 0.07 
(68% CL) for Zmax = 0.1 and 0.2, respectively. The for- 
mer result is fully consistent with what we find from the 
nearby clusters in Section[L3l a = 0.63 ±0.09 (68% CL; 
for 1 X lO^" h"^ Mq < M500 < 9 X lO^-* h-^ Mq and 
z < 0.09). 

The significance level of statistical detection of the SZ 
effect is about Sa for z,„ax = 0.2. With the systematic 
error included, we find a = 0.59 ± 0.07^0^3 for z^ax = 
0.2; however, the above agreement may suggest that the 
fiducial scaling relation (equation ([5T|) ) is, in fact, a good 
one. 

As we have shown in Section 17.31 the measured SZ ef- 
fects and the predictions from the X-ray data agree on a 
cluster-by-cluster basis. A plausible explanation for the 
discrepancy between the WMAP data and Arnaud et 
al.'s profile is that Arnaud et al.'s profile does not distin- 
guish between cooling-fiow and non-cooling-flow clusters. 

Nevertheless, this result, which shows that the SZ ef- 
fect seen in the WMAP data is less than the average 
"expectation" from X-ray observations , agrees qualita- 
tively with some of the previous work ([Lieu et al.l [20061 : 
IBielbv fc Sha^ [20071: IDieeo fc Partridge! 120091) ^ The 
other work showed that the SZ effect seen in the WMAP 
data is consistent with expectati ons from X-r ay observa- 
tions (Afshordi et aL 2007; Meli n et al.|[20Tol ). 

These author s used wide l y different methods and clus- 
ter catalogues. [Lieu et al.l l|2006l) were the first to claim 
that the SZ effect seen in the WMAP data is significantly 
less than expected from X-ray data, by using 31 clusters 
compiled by Bonamcnte et al., (2002). Biclbv & Shanki 
(j2007j ) extended the analvs is of [Lieu et al . (2006) by us- 
ing 38 clusters compiled bv lBonamente et al. (2006), for 
which the observational data of the SZ effect from OVRO 
and Berkeley Illinois Maryland Association (BIMA) are 
available. They did not use scaling relations, but used 
a spherical isothermal /3 model to fit the X-ray surface 
brightness profile of each cluster in the catalog, and cal- 
culated the expected SZ signa ls, assumin g that the in- 
tracluster gas is isothermal. [Lieu et all ()2006l ) found 
that the measured signal is smaller th an expected from 
X-ray data by a factor of 3-4, and IBielbv fc Shankj 
(|2007J found a similar result for the cluster catalog of 
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TABLE 13 

Best-fitting Amplitude of Gas Pressure Profile" 



Gas Pressure Profile 



Type 



. 0.2 



High 



Arnaud et al. (20101 
Arnaud et al. (2010) 
Arnaud et al. (2010) 
Arnaud et al. (2010) 
Arnaud et al. (2010) 
Komatsu &: Seliak (20011 
Komatsu &: Seliak (20011 
Nagai et al. (2007 ) 
Nagai et al. (2007 ) 



X-ray Obs. (Fid.)' 
REXCESS scaling' 
intrinsic scaling^ 
rout = 2r5oo* 
rout = r soo'' 
equation iPlGt 
equation ]D17t 
Non-radiative 
Cooling-I-SF 



0.64 ± 0.09 0.59 ± Omt^'^l 0.67 ± 0.09 0.43 ± 0.12 



0.90 ± 0.12 0.55 ± 0.16 
0.84 ± 0.11 0.46 ± 0.13 
0.67 ± 0.09 0.43 ± 0.12 
0.74 ± 0.09 0.44 ± 0.14 



N/A 0.78 ±0.09 

N/A 0.69 ±0.08 

N/A 0.59 ±0.07 

N/A 0.65 ±0.08 

0.59 ± 0.09 0.46 ± 0.06^0 0.49 ± 0.08 0.40 ± 0.11 
0.67 ± 0.09 0.58 ± 0.07tp g 

N/A 

N/A 



0.66 ± 0.09 0.43 ± 0.12 



0.50 ± 0.06^0 ^1 0.60 ± 0.08 0.33 ± 0.10 
0.67 ± 0.08to 23 "■''S ± O-l" "-45 ± 0.14 



" The quoted error bars show 68% CL. The first error is statistical, while the second error is 
systematic. The systematic error is caused by the calibration error in the size-luminosity relation 
(rsoo-^x relation; see equation (I81t and discussion below it). While we quote the systematic error 
in the amplitudes only for jSmax — 0.2, the amplitudes for Zmux — 0.1 also have similar levels of the 
systematic error. Due to a potential contamination from unresolved radio sources, the best-fitting 
amplitudes could also be underestimated by 5 to 10%. This is not included in the systematic error 
budget because it is sub-dominant. See Section l7.7l for discussion on the point source contamination. 
^ "High Lx" uses clusters with 4.5 < Lx/(10 erg s^^) < 45 and z < 0.2. Before masking, there 
are 82 clusters. The quoted errors are statistical. 

*^ "Low Lx" uses clusters with 0.45 < Lx / (10'*'* erg s~ ^ ) < 4.5 and z < 0.2. Before masking, there 
are 417 clusters. Clusters less luminous than these (129 clusters are fainter than 0.45 X 10^^ erg s~*) 
do not yield a statistically significant detection. The quoted errors are statistical. 
^ With the fiducial scaling relation between rsoo and Lx , ^500 — 

0.753 h-i^ipcj^^/(^044 5- 1 )]"■ 228 jBohringer et al.l [2007^ 



For 

corresponds to M500 — 4.1 and 3.9 X 10^^ h 
erg s^^ corresponds to A-/500 — 0.84 and 0.80 X 10 



this scaling rela- 
^ Mq for z = 0.1 

-1 Afo 



for 



0.717 fe~^ Mpc //in-l 



erg s )] 



0.222 



tion, Lx = 4.5 x 10** org s^ 
and 0.2, and Lx = 0.45 x 10* 
z — 0.1 and 0.2, respectively. 

With the "REXCESS" scaling relation, rsoo 

used by IMelin et all ||2010D . For this scaling relation, Lx — 4.5 x 10^^ erg s^^ corresponds to 
M500 = 3.4 and 3.1 X 10^'* Mq for z = 0.1 and 0.2, and Lx = 0.45 X 10"** erg s"^ corresponds 

to M500 = 0.73 and 0.68 X lO" /i"^ Mq for z = 0.1 and 0.2, respectively. The quoted errors are 
statistical. 

^ With the "intrinsic' 

IMelin et all (i20To|) 

and 3.4 x 10^^ h~ 



Mpc 



[Lx/(10^^ h-^ergs-')] 



scaling relation, t^qq ^ —^1.15^^^ 
For this scaling relation, Lx — 4.5 x 10^^ erg s 
Mq for z ^ 0.1 and 0.2, and Lx ^ 0.45 x 

Mq for 2: — 0.1 and 0.2, respectively. 



0.207 



M500 ^ 0.88 and 0.82 x 10^^ 
statistical. 

^ The gas extension is truncated at r^ut — 2r5( 
is used. The quoted errors are statistical. 
^ The gas extension is truncated at Tout — '''500 
used. The quoted errors are statistical. 



used by 

^ corresponds to M500 — 3-7 
10'^'^ ergs"""'" corresponds to 
The quoted errors are 



instead of 6r5oo- The fiducial v^qq-Lx relation 



instead of Gr^oo- The fiducial ^ 



clati( 



iBonamente et a l.l (12000). 

iDiego fc Part ridge (2009) used the same cluster cat- 
alog that we use (REFLEX-f eBCS), but used a differ- 
ent scaling relation: they related the cluster core radius 
to the X-ray luminosity (we relate rsoo to the X-ray lu- 
minosity). They found a large discrep ancy (similar to 
iLieu et al.ll2Q0a iBielbv fc Shanks! l2007l ) when a spheri- 
cal isothermal /3 model was used to predict the SZ sig- 
nal, while they found a smaller discrepancy (similar to 
our results) when m ore realistic gas models were used. 
lAfshordi et alj (|2007[ ) used 193 clusters selected from the 
XBAC catalog. Their catalog consisted of the clusters 
that have measured X-ray temperatures (> 3 keV). They 
then used a scaling relation between r2oo and the X-ray 
temperature. They found that the measured SZ signal 
a nd X-ray data are c onsistent . 

IMelin et all (|2010D used the 5-year WMAP data and 
a bigger sample of 893 clusters and a scaling rela- 
tion betwee n rsoo a nd the X-ray l u minos ity taken from 
iPratt et al.l (|2009l ): lArnaud et all (|20Tot ). They com- 
pared the measured integrated pressure from the WMAP 
data to the expectation from Arnaud et al.'s profile, and 
concluded that they agree very well. (The normaliza- 
tion is consistent with unity within the statistical un- 



certainty.) We find, on the other hand, that the nor- 
malization is significantly less than unity compared to 
the statistical uncertainty. How can we reconcile these 
results? 

One possibility would be the difference in the scaling 
relations. The scaling relation shifted down by Icr would 
make the predicted SZ signals smaller, which would then 
increase the best-fitting amplitude. Given the size of the 



systematic error, a = 0.59 ± 0.07 



inconsistent with the data, 
scaling relations: 



+0.38 
-0.23' 



1 may not be 



Specifically, they used two 



1- = 



2- = 



0.717 hr 



Mpc r 



£1.19(2) [-^50o/(10 

0.745 /.-^ Mpc[^^^^/(io44 



E^-^Hz) 



erg 



erg s 



S-l)]0-222, 
-I^JO.207 



where the relations 1 and 2 corres pond to the "REX- 
CESS" and "intrinsic" relations in IMelin et"an (|2010( ). 
respectively. Here, L500 is the X-ray luminosity mea- 
sured within 7^500, which is calculated from Lx- While 
we do not have the conversion factors they used, a typ- 
ical magnitu de of the co n versio n factors is about 10%, 
according to IMelin et"all ()2010D . A 10% change in Lx 
gives a 2% change in rsoo, which is negligible com- 
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pared to the other uncertainties; thus, we shall assume 
that Lx and £500 are the same, and repeat our anal- 
ysis using these scaling relations. We find the ampli- 
tudes of a = 0.78 ± 0.09 and 0.69 ± 0.08 (^^ax = 0.2; 
68% CL) for the relations 1 and 2, respectively; thus, 
while these scaling relations give larger amplitudes, they 
cannot completely explain the difference between the re- 
sults of lMelin et al.l ()2010[ ) (a ~ 1) and our results. How- 
ever, we find that the discrepancy is much less for high 
X-ray luminosity clusters. See S ection FTBl 

While the method of Mcli n et al.l (|2010l ) and our 
method are similar, they are different in details. We 
compare the predicted angular radial profiles of the SZ 
effect to the WMAP dat a to find the best-fitting ampli- 
tude. iMelin et al.l (|20100 measured the integrated pres- 
sure within 5 times rsoo, and converted it to the inte- 
grated pressure within rsoo, i^soo, assuming the distri- 
bution o£_2ressurebeyondr5oo is described by the pro- 
file of lArnaud et al.l ()2010() . Whether the difference in 
methodology can account for the difference between our 
results and their results is unclear, and requires further 
investigational 

In any case, we emphasize once again that the SZ ef- 
fect measured by the WMAP and the predictions from 
X-ray data agree well, when the actual X-ray profile of 
individual clusters, rather than the average (or median) 
profile, is used, and there is a reason why Arnaud et al.'s 
profile would overpredict the pressure (i.e., cooling fiows; 
see Section [773]) . There fore, it is like ly that the difference 
between our results and IMelin et al.l |2010,) simply points 
to the fundamental limitation of the analysis using many 



^" There 
IMehn et aU 



differences in the estimators used. In 
I, "matched-filter estimator" proposed by 



[Herranz et al. (20Q2) was used for estimating the normalization of 
the Arnaud ot al. profile. Their estimator is essentially the same 
as the optimal estimator wc derive in Appendix [C] with some dif- 
ferences in details of the implementation. Their estimator is given 
by, in our notation, 



where 



Pi v,{p- 



(82) 



(83) 



Here, t and d are the 2d Fourier transforms of a template map, t, 
and the data map, d, respectively, and P^/j ^1 is the power spec- 
trum of the CMB signal plus instrumental noise, both of which are 
assumed to be diagonal in Fourier space. The summation over the 
repeated indices is understood. For comparison, our estimator for 
the same quantity is given by 



(84) 



where Ctot = A^pix -I- A5harm^"^ is the pixel-space covariance ma- 
trix of the CMB signal plus instrumental noise (see equation IC9I1 . 
and 

F = t^ipi (Cf^^^ ),^ipi _^pt,yp. (85) 
There are two differences in the implementation: 

1. IMelin et all II2010I ) re- project the WMAP data onto 504 
square (10° X 10°) tangential overlapping flat patches, and 
calculate the above 2d Fourier transform on each flat patch. 
We perform the analysis on the full sky by calculating the 
covariance matrix with the spherical harmonics. 

2. IMelin et al.l 112010 ) calculate P from the data. We calculate 
C from the best-fitting ACDM model for the CMB signal 
and the noise model. 



clusters (with little or no X-ray data) and scaling rela- 
tions. 

7.5.5. Results: KS profile and hydrodynamical simulation 

Let us turn our attention to the analytical KS profile. 
For the KS profile with the concentration parameter of 
[Seliak (2000), wc find the best-fitting amplitudes of a = 
0.59 ± 0.09 and 0.46 ± O.OOtg jg (68% CL) for z„ 



0.1 and 0.2, respectively. For the KS p rofile with the 
concentration parameter of iDuffv et al.l ()2008[ ). we find 



= 0.67±0.09 and 0.58 ±0.07 



-1-0.33 

'0.20 



CL) for 



0.1 and 0.2, respectively. These results are consistent 
with those for Arnaud et al.'s pressure profiles. 

Recently, the SPT collaboration detected the SZ power 
spectrum at / > 3000. By fitting th eir SZ power spec- 
trum data to the theoretical model of 'Komats u fc SeliakI 
([2002), they found the best-fitt ing amplit ude of Asz = 
0.37 ± 0. 17 (68% CL; ll uckcr ct^ [20I0I) . The calcu- 
lation of iKomatsu fc Seliak (20021 ) is based on the KS 
gas pressure profile. As the amplitude of SZ power spec- 
trum is proportional to the gas pressure squared, i.e., 
Asz oc a^, our result for the KS profiles, a ss 0.5 — 0.7, is 
consistent with Asz = 0.37 ± 0.17 found from SPT. The 
ACT colla boration placed an upper limit of Asz < 1-63 
(95% CL: iFowler et al.ll2010[) . which is consistent with 
the SPT resuh. 

What do hydrodynamical si mulations tell us? As the 
analytical calculations such as IKomatsu fc SeliaS ()2001l ) 
are limited, we also fit the pre ssure profiles derive d from 
hydrodynamical simulations of lNagai et al.l ()2007[ ) to the 
WMAP data. In the bottom panels of Figure [ITl we 
show the gas pressure profiles from "Non-radiative" and 
"Cooling-|-Star Formation (SF)" runs. 

By fitting the SZ templates constructed from these 
simulated profiles to the WMAP data, we find the 
best-fitting amplitudes of 0.50 ± 0.06 



+0.: 

'0.18 



and 0.67 ± 



CL) for Non-radiative and Cooling+SF 
runs, respectively, which are consistent with the ampli- 
tudes found for the KS profiles and Arnaud et al.'s pro- 
files. See Table [T3l for a summary of the best-fitting am- 
plitudes. 

That the KS, simulation, and Arnaud et al.'s profiles 
yield similar results indicates that all of these profiles 
overpredict the amount of SZ effect seen in the WMAP 
data by '-^ 30 — 50%. This conclusion is made robust 
by the results we presented in Section 17.31 the analysis 
that does not use scaling relations between Lx and rsoo , 
but uses only a subset of clusters that have the detailed 
follow-up observations by Chandra, yields the same re- 
sult. This is one of the main results of our SZ analysis. 

7.6. Luminosity bin analysis 

To see the dependence of the best-fitting normalization 
on X-ray luminosities (hence M500), we divide the cluster 
samples into 3 luminosity bins: (i) "High Lx" with 4.5 < 
Lx/{W^'^ ergs-i) < 45, (ii) "Low Lx" with 0.45 < 
Lx/(10^'* erg s^^) < 4.5, and (in) clusters fainter than 

(ii) . There are 82, 417, and 129 clusters in (i), (ii), and 

(iii) , respectively. In Table [T^l we show that we detect 
significant SZ signals in (i) and (ii), despite the smaller 
number of clusters used in each luminosity bin. We do 
not have a statistically significant detection in (iii). 
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The high Lx clusters have M > 4 x 10" h'^ Mq. 
For these clusters, the agreement between the WMAP 
data and the expected SZ signals is much better. In 
particular, for the REXCESS scaling relation, we find 
a = 0.90 ±0.12, which is consistent with unity within 
the 1(7 statistical error. This implies that, at least for 
high X- ray luminosi t y clus ters, our results and the re- 
sults of iMelin et al.l (|2010[) agree within the statistical 
uncertainty. 

On the other hand, we find that less luminous clus- 
ters tend to have significantly lower best-fitting ampli- 
tudes for all models of gas-pressure profiles and scaling 
relations that we have explored. This trend is consis- 
tent with, for example, the gas mass fraction being lower 
for lower mass clusters. It is also consistent with radio 
point sources filling some of the SZ effect seen in the 
WMAP data. For the point source contamination, see 
Section O 

7.7. Systematic Errors 

The best-fitting amplitudes may be shifted up and 
down by 50% due to the calibration error in the size- 
luminosity relation (equation (j8ip ). As we have shown 
already, the best-fitting amplitudes for the KS profiles 
can be shifted up to .77 an d 0. 91 for the concentra tion 
parameters of Seljil (|2000[ ) and iDuffv eFall ()2008[ ). re- 
spectively. Similarly, the amplitude for Arnaud et al.'s 
profile can be shifted up to 0.97. As this calibration er- 
ror shifts all amplitudes given in Table [T3] by the same 
amount, it does not affect our conclusion that all of the 
gas pressure profiles considered above yield similar re- 
sults. 

This type of systematic error can be reduced by using 
a subset of clusters of galaxies for which the scaling rela- 
tions are more tightly constrained (see, e.g..lPratt et al.l 
[20091: IVikhlinin et all [2009at IMantz et al.li2Q10aD : how^ 
ever, reducing the number of samples increases the statis- 
tical error. Indeed, the analysis presented in Section [7.31 
does not suffer from the ambiguity in the scaling rela- 
tions. 

How important are radio point sources? While we 
have not attempted to correct for potential contamina- 
tion from unresolved radio point sources, we estimate 
the magnitude of effects here. If, on average, each clus- 
ter has a i^src = 10 mJy source, then the corresponding 
temperatures. 



ATsrc = 40.34 /iK 



sinh^(x/2) Fs, 



10- 



sr 



10 mJy fibc 



(86) 



are 2.24, 2.29, and 2.19 in Q, V, and W bands, re- 
spectively. Here, x — i^/(56.78 GHz), and ribeam = 9.0 x 
10~^, 4.2 X 10~^, and 2.1 x 10"^ sr are the solid angles of 
beam s in Q, V, and W bands, respectively ([Jarosik et al.l 
[20101 ). Usi ng the rad i o sou rc es observed in clus t ers o f 
galaxies by [Lin et al.l (|2009[) . [Diego k Partridge! ([2009[ ) 
estimated that the mean flux of sources in Q band is 
10.4 mJy, and that at 90 GHz (which is close to 94 GHz 
of W band) is w 4 to 6 mJy. Using these estimates, we ex- 
pect the source contamination at the level of ss 1 to 2 fj,K 
in V and W bands, which is ss 5 to 10% of the measured 
SZ temperature. Therefore, the best-fitting amplitudes 
reported in Table |T3| could be underestimated by 5 to 
10%. 



7.8. Discussion 

The gas pressure profile is not the only factor that 
determines the SZ power spectrum. The other important 
factor is the mass function, dn/dM: 



Ci cx 



dz—— 

dz 



dM' ' 



2d|2 



(87) 



where V{z) is the comoving volume of the universe and 
P^*^ is the 2d Fourier transform of P'^'^{9). Therefore, a 
lower-than-expected A^z may imply either a lower-than- 
expected amplitude of matter density fiuctuations, i.e., 
fJs, or a lower-than-expected gas pressure, or both. 

As the prediction s for the SZ power spectrum avail- 
able today (see, e.g. . [Shaw et~all[2009l : [Sehgal et al.[[2010L 
an d references there i n) ar e similar to t he prediction 
of [Komatsu fc SeliakI (|2002| ) (for example, iLuek er et al.l 
([20101) foun d Ar7, = 0.55 ± 0.21 for the prediction of 



Sehgal et a l. (2010), which is based on the gas model of 



Bode et al.1 (,2009.) ). a plausible explanation for a lower- 



t han-expected Ap,7, is a lower-than-expected gas pressure. 

[Arnaud et al.l (|2007D find that the X-ray observed inte- 
grated pressure enclosed within rsoo, Yx = Afgas,5oo2x, 
for a given M500 is about a factor of 0.75 ti mes the pre- 
dictio n from the Cooling -1-SF simulation of [Nagai et al.l 
(|20f)l . This is in good agreement with our corre- 
sponding result for the "High Lx" samples, 0.79 ± 0.10 
(68% CL; statistical error only). 

While the KS profile is generally in good agreement 
with Arnaud et al.'s profile, the former is more ex- 
tended than the latter (see Figure [T7[) . which makes 
the KS prediction for the projected SZ profiles bigger. 
Note, ho wever, that the outer slope of the fitti ng formula 
given bv [Arnaud et al.[ (|2010l ) (equation (jD3P ) has been 
fo rced to match tha t from hydrodynamical simulations 
of INagai et all (pOOTi ) in r > rsoQ. See the bottom panels 
of Figure [ITl The steepness of the profile at r > rsoo 
from the simulation may be attributed to a significant 
non-thermal pressure support from pv^, which makes it 
possible to balance gravity by less thermal pressure at 
larger radii. In other words, the total pressure (i.e., ther- 
mal plus pv'^) profile would probably be closer to the KS 
prediction, but the thermal pressure would decline more 
rapidly than the total pressure would. 

If the SZ effect seen in the WMAP data is less than 
theoretically expected, what would be the implications? 
One possibility is that protons and electrons do not share 
the same temperature. The electron-proton equilibration 
time is longer than the Hubble time at the virial radius, 
so that the electron temperature may be lower than the 
proton temperature in the outer regions of clusters which 
contribute a sig nificant fraction of the predi cted SZ flux 
(|R,udd fc Nagail[2009l : [m"ng fc Sarazinl[2009l ). The other 
sources of non-thermal pressure support in outskirts of 
the cluster (turbulence, magnetic field, and cosmic rays) 
would reduce the thermal SZ effect relative to the ex- 
pectation, if these effects are not taken into account in 
modeling the intracluster medium. Heat conduction may 
also play some role in suppressing the gas pressure (Loebl 
12002, 2007). 

In order to explore the impact of gas pressure at r > 
1^500, we cut the pressure profile at Tout = ''500 (instead of 
6^500) and repeat the analysis. We find a = 0.74 ± 0.09 
and 0.44±0.14 for high and low Lx clusters, respectively. 
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(We found a = 0.67±0.09 and 0.43±0.12 for Tout = Orsoo- 
See Table [131) These results are somewhat puzzling - 
the X-ray observations directly measure gas out to r^oo-, 
and thus we would expect to find a w 1 at least out to 
rsoo- This result may suggest that, as we have shown in 
Section 17.31 the problem is not with the outskirts of the 
cluster, but with the inner parts where the cooling flow 
has the largest effect. 

The relative amplitudes between high and low Lx clus- 
ters suggest that a signiflcant amount of pressure is miss- 
ing in low mass {M^qq < 4 x 10^"* h^^ Mq) clusters, even 
if we scale all the results such that high-mass clusters 
are forced to have o = 1. A sim ilar trend is also seen 
in Figure 3 of iMelin et al.l ()2010|) . This interpretation is 
consistent with the SZ power spectrum being lower than 
theoretically expected. The SPT measures the SZ power 
spectrum at I > 3000. At such high multipoles, the con- 
tributions to the SZ power spectrum are dominated by 
relatively low-ma ss clusters, M^on ^ 4 x 10^"^ Mq 
(see Figure 6 of IKomatsu fc Seliak] l200l) . Therefore, 
a plausible explanation for the lower-than-expected SZ 
power spectrum is a missing pressure (relative to theory) 
in lower mass clusters. 

Scaling relations, gas pressure, and entropy of low- 
mass clusters and groups ha v e bee n studied in the 
literature|3 iLeauthaud et al.l ()2010[ ) obtained a rela- 
tion between Lx of 206 X-ray-selected galaxy groups 
and the mass (M200) derived from the stacking anal- 
ysis of weak lensing measurements. Converting their 
best-fitting relation to r2oo-Lx relation, we find r2oo — 
1.26 h-^ Mp- [Lx/{10^^ h-^ ergs-i)]0-22 



(Note that 



the pivot luminosity of the original scaling relation is 
2.6 X lO''^ /i^2 erg s~^) As rsoo ~ 0.65r2oo, their rela- 
tion is ~ la higher than the fiducial scaling relation that 
we adopted (equation (|8l]) ). Had we used their scaling 
relation, we would find even lower normalizations. 

The next generation of simulations or analytical cal- 
culations of the SZ effect should be focused more on un- 
derstanding the gas pressure profiles, both the amplitude 
and the shape, especially in low-mass clusters. New mea- 
surements of the SZ effect toward many individual clus- 
ters with unprecedented sensitivity arc now becoming 
available (iSta niszcwski et al. 2009; Hincks et al. 2003 
iPlagge et al.l 120100 . These new measurements would be 
important for understanding the gas pressure in low-mass 
clusters. 

8. CONCLUSION 

With the WMAP 7-year temperatur e and polariza - 
tion data, new measurements of Hq (Riess et al.ll2009D, 
and i mproved large-scale structure data (jPercival et all 
l2009f l. we have been able to rigorously test the stan- 
dard cosmological model. The model continues to be 
an exquisite fit to the existing data. Depending on 
the parameters, we also use the other data sets such 
as the small-scale CMB temperature power spectra 
(|Brown et al.ll2009l: iReichardt et al.ll2009l for the primor- 
dial helium abunda nce), the power s pectrum of LRGs 
derived from SDSS (|Reid et al.l l 2010aL for neutrino prop- 
erties), the Type la supernova data (jHicken et aLll2009bL 
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Fig. 20. — Two-dimensional joint marginalized constraint (68% 
and 95% CL) on the primordial tilt, ris, and the tensor-to-scalar 
ratio, r, derived from the data combination of WMAP+HAO+Hq. 
The symbols show the predictions from "chaotic" inflation mod- 
els whose potential is given by V(0) oc ifLlnde 1983), with 
« = 4 (solid) and a = 2 (dashed) for single-field models, and 
a = 2 for multi-axion fi eld models with /3 = 1/2 (dotted; 
lEasther fc McAllisterll200g '). 

for dark energy), and t he time-delay dis tance to the 
lens system B1608-f656 (|Suvu et al.| l2010'. for dark en- 
ergy and spatial curvature). The combined data sets 
enable improved constraints over the WMAP-only con- 
straints on the co smological parameters presented in 
iLarson et al.l ()201Cl[ ) on physically-motivated extensions 
of the standard model. 

We summarize the most significant findings from our 
analysis (also see Table [21 [H and [4]): 

1. Gravitational waves and primordial power 
spectrum. Our best estimate of the spectral index 
of a power- law primordial power spectrum of curva- 
ture perturbations is Us = 0.968 ±0.012 (68% CL). 
We find no evidence for tensor modes: the 95% CL 
limit is r < 0.24[3 There is no evidence for 
the running spectral index, dng/dlnk = — 0.022 ± 
0.020 (68% CL). Given that the improvements 
on Hs, r, and dns/dlnk from the 5-year results 
are modest, their implications for models of infia- 
tion are similar to those discussed i n Section 3.3 
of Koma tsu et al.l (l2009aD . Also see IKinnev et al 
(j2008), Peiris fc EastheJ (|2008[) and iFinelh eTal 



A systematic study of the the rmodynamic properties o f low- 
mass cluste rs and groups is given in lFinoguenov et al.l 1)20071 ) (also 
see Finoguenov et al.ll2005ai rhtl 



(2010) for more recent surveys of implications for 
inflation. In Figure I20[ we compare the 7-year 
WMAP+BAO+Ho limits on and r to the pre- 
dictions from inflation models with monomial po- 
tential, !/((/)) cx 

2. Neutrino properties. Better determinations of 
the amplitude of the third acoustic peak of the 
temperature power spectrum and Hq have led to 
improved limits on the total mass of neutrinos, 
^TOi, < 0.58 eV (95% CL), and the effective num- 
ber of neutrino species, N^s = 4.34t°j^ (68% CL), 
both of which are derived from WMAP-|-BAO-|-iJo 
without any information on the growth of struc- 
ture. When BAO is replaced by the LRG power 

This is the 7-year WMAP+BAO+Hq limit. The 5-year 
WMAP-(-BAO-|-SN limit was r < 0.22 (95% CL). For comparison, 
the 7-year WMAP+BAO+SN limit is r < 0.20 (95% CL). These 
limits do not include systematic errors in the supernova data. 
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spectrum, we find J2 ""^v < 0.44 eV (95% CL), and 
the effective number of neutrino species, N^s = 
4..25t°io (68% CL). 

3. Primordial helium abundance. By combining 
the WMAP data with the small-scale CMB data, 
we have detected, by more than 3<7, a change in 
the Silk damping on small angular scales {I > 500) 
due to the effect of primordial helium on the tem- 
perature power spectrum. We find Yp = 0.326 ± 
0.075 (68% CL). The astrophysical measurements 
of helium abundance in stars or HII regions pro- 
vide tight upper limits on Yp, whereas the CMB 
data can be used to provide a lower limit. With 
a conservative hard prior on "5^ < 0.3, we find 
0.23 <Yp< 0.3 (68% CL). Our detection of he- 
lium at z '-^ 1000 contradicts versions of the "cold 
big bang model," where most of the cosmological 
helium is prod uced by the first generation of stars 
(|Aguirrell2000l) . 

4. Parity violation. The 7-year polarization data 
have significantly improved over the 5-year data. 
This has led to a significantly improved limit on 
the rotation angle of the polarization plane due to 
potential parity- violating effects. Our best limit is 
Aa = -1.1° ± 1.4° (statistical) ± 1.5° (systematic) 
(68% CL). 

5. Axion dark matter. The 7-year 
WMAP+BAO+Hq hmit on the non-adiabatic 
perturbations that are uncorrelated with curvature 
perturbations, ao < 0.077 (95% CL), constrains 
the parameter space of axion dark matter in 
the context of the misalignment scenario. It 
continues to suggest that a future detection of 
tensor-to-scalar ratio, r, at the level of r = 10~^ 
would require a fine-tuning of parameters such as 
the misalignment angle, 9 < 3x 10~^, a significant 
amount of entropy production between the QCD 
phase transition and the big bang nucleosynthesis, 
7 < 0.9 X 10~®, a super-Planckian axion decay 
constant, /q > 3 x 10^^ GeV, an axion contribu- 
tion to the matter density of the universe being 
totally sub-dominant, or a combination of all 
of the above with less tuning in each (also see 
Section 3.6.3 of Komatsu et al. 2009j|). The 7-year 
WMAP+BAO+Hq limit on correlated isocur- 
vature perturbations, which is relevant to the 
curvaton dark matter, is a_i < 0.0047 (95% CL). 

6. Dark energy. With WMAP+BAO+Hq but with- 
out high-redshift Type la supernovae, we find 
w = -1.10 ± 0.14 (68% CL) for a flat uni- 
verse. Adding the supernova data reduces the er- 
ror bar by about a half. For a curved universe, 
addition of supernova data reduces the error in 
w dramatically (by a factor of more than four), 
while the error in curvature is well constrained by 
WMAP+BAO+Hq. In Figure [H we show the 7- 
year limits on a time-dependent equation of state 
in the form of w — wq + Wa{l — a). We find 
wq = -0.93 ±0.13 and Wa = -0.4li°-^? (gg^ 
from WMAP+BAO+Hq+SN. The data are consis- 



tent with a flat universe dominated by a cosmolog- 
ical constant. 

7. Primordial non-Gaussianity. The 95% CL 

limits on physically-motivated primordial non- 
Gaussianity parameters are —10 < /j^^*^' < 74, 
-214 < < 266, and -410 < °s < q 

When combined with the limit on from SDSS, 
-29 < < 70 (|Slosar et al.ll^Ol . we find 

—5 < f^^'j^^ < 59. The data are consistent with 
Gaussian primordial curvature perturbations. 

8. Sunyaev— Zel'dovich effect. Using the optimal 
estimator, we have measured the SZ effect to- 
wards clusters of galaxies. We have detected the 
SZ effect toward the Coma cluster at 3.6a, and 
made the statistical detection of the SZ effect by 
optimally stacking the WMAP data at the loca- 
tions of known clusters of galaxies. By stacking 
11 nearby massive clusters, we detect the SZ ef- 
fect at 6.5(7, and find that the measured SZ sig- 
nal and the predictions from the X-ray data agree 
well. On the other hand, we find that the SZ sig- 
nal from the stacking analysis is about 0.5 — 0.7 
times the predictions from the current generation 
of analytical calculations, hydrodynamical simu- 
lations, and t he "u niversal pressure profile" of 
lArnaud et "all (|2010n . We detect the expected SZ 
signal in relaxed clusters that have cool cores. We 
find that the SZ signal from non-relaxed clusters 
have SZ signals that are 50% of the signal pre- 
dicted by Arnaud et al.'s profile. The discrep- 
ancy with theoretical predictions presents a puz- 
zle. This lower-than-theoretically-expected SZ sig- 
nal is consistent with the lower-than-theoretically- 
expected SZ power spec trum recently measured by 
the SPT collaboration (|Lueker et al.ll2010[ ). While 
we find a better agreement between the WMAP 
data and the expectations for massive clusters with 
-^500 ^ 4 X 10^'* Mq, a significant amount 
of pressure (relative to theory) is missing in lower 
mass clusters. Our results imply that we may not 
fully understand the gas pressure in low-mass clus- 
ters. This issue would become particularly impor- 
tant when the SZ effect is used as a cosmological 
probe. 

We also reported a novel analysis of the WMAP tem- 
perature and polarization data that enable us to directly 
"see" the imprint of adiabatic scalar fluctuations in the 
maps of polarization directions around temperature hot 
and cold spots. These give a striking confirmation of our 
understanding of the physics at the decoupling epoch in 
the form of radial and tangential polarization patterns 
at two characteristic angular scales that are important 
for the physics of acoustic oscillation: the compression 
phase at 6* = 29a and the reversal phase at 6 = 9a- 

The CMB data have provided us with many stringent 
constraints on various properties of our universe. One 
of many lessons that we have learned from the CMB 
data is that, given the data that we have, inventions 
of new, physically-motivated, observables beyond the 
spherically-averaged power spectrum often lead to new 
insights into the physics of the universe. Well-studied 
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examples include primordial non-Gaussianity parame- 
ters (/nl from the bispectrum), parity- violation angle 
(Aa from the TB and EB correlations), modulated pri- 
mordial power s pectrum (q(k) from di r ection-dependent 
powe r spec tra; Ackcrman e t a.1.1 12007t IHanson fc LewisI 
120091 iGroeneboom e t al, ,2_01fl see Bennett et al. 2010 
for the 7-year limits). 

The data continue to improve, including more integra- 
tion of the WMAP observations. At the same rate, it 
is important to find more ways to subject the data to 
various properties of the universe that have not been ex- 
plored yet. 
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APPENDIX 

A. EFFECTS OF THE IMPROVED RECOMBINATION HISTORY ON THE ACDM PARAMETERS 

The constraints on the cosmological parameters reported in the original version of this pape r were based on a 
version of CAMB which used a recombina tion history calculated by the RECFAST version 1.4.2 (jSeager et al.l[l99l 
l2000HWong et al.ll2008l : iScott fc Mossll200a) . Shortly after the submission of the original version, a new version CAMB 
was released with the RECFAST version 1.5. This revision incorporates the improved treatment of the hydrogen and 
helium recombination, following numerous work done over the last several years fsee iRubino-Martm et al1l2010l and 
references therein). Speci fically, the code multiplies th e ionization fraction, Xe{z), by a cosmology-independent "fudge 
function," f{z), found bv iRubino-Martm et al.l (|2010[ ). A change in the recombination history mostly affects the time 
and duration of the photon decoupling which, in turn, affe cts the amount of Silk dam ping. Therefore, it is expected 
to affect the cosmological parameters such as Ug and rtbh"^ (jRubino-Martfn et al.ll20ldl ). 

In order to see the effects of the improved recombination code on the cosmological parameters, we have re-run the 
ACDM chain with the latest CAMB code that includes RECFAST version 1.5. We find that the effects are small, and 
in most cases negligible compared to the error bars; however, we find that the significance at which n^, = 1 is excluded 
is no longer more than 3(t: with the improved recombination code, we find rig = 0.968 ± 0.012 (68% CL), and Ug = 1 
is excluded at 99.5% CL. 

Finally, the WMAP likelihood code has also changed from the initial version (4.0), which used a temperature power 
spectrum with a slightly incorrect estimate for the residual point-source amplitude, and a TE power spectrum with 
a slightly incorrect /sky factor. The new version (4.1) corrects both errors; however, the change in the parameters is 
largely driven by the above modification of the recombination history. 

Throughout the main body of this paper, we have adopted the new parameters for the simplest 6-parameter ACDM 
model, but we have kept the previous parameters for all the other models because the cha nges are too small to report. 
We compare the ACDM parameters derived from WMAP+BAO+Hq in Table [Ml See iLarson et all (|2010( 1 for the 
comparison of WMAP-only parameters. 

B. STACKED PROFILES OF TEMPERATURE AND POLARIZATION OF THE CMB 
B.l. Formulae of Stacked Profiles from Peak Theory 

In order to calculate the stacked profiles of temperature and polarization of the CMB at the locations of temperature 
peaks, we need to relate the peak number density contrast, Jpk, to the underlying temperature fluctuation, AT. 

One often encounters a similar problem in the large-scale structure of the universe: how can we relate the number 
density contrast of galaxies to the underlying matter density fluctuation? It is often assumed that the number density 
contrast of peaks with a given peak height v is simply proportional to the underlying density fleld. If one adopted 
such a linear and scale-independent bias prescription, one would fincF^ 

5pk(n) = 6,Ar(n). (Bl) 

For convenience, we write the bias parameters in units of r, * 1-1 

' ^ [temperature] . 
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TABLE 14 

Comparison of the ACDM parameters (WMAP+BAO+Ho): RECFAST version 1.4.2 versus 1.5 



Class Parameter ML (1.5) ML (1.4.2) Moan (1.5) Mean (1.4.2) 



Primary 


lOOSlfah^ 


2.253 


2.246 


2.255 ± 0.054 


2.260 ± 0.053 






0.1122 


0.1120 


0.1126 ± 0.0036 


0.1123 ± 0.0035 






0.728 


0.728 


0.725 ± 0.016 


n 79Q + 0.015 




Us 


0.967 


0.961 


0.968 ± 0.012 


0.963 ± 0.012 




T 


0.085 


0.087 


0.088 ± 0.014 


0.087 ± 0.014 




A^(feo) 


2.42 X 10"^ 


2.45 X 10"^ 


(2.430 ± 0.091) X 10"^ 


(2.44liS;Sf,) X 10-« 


Derived 




0.810 


0.807 


0.816 ± 0.024 


0.809 ± 0.024 




Ho 


70.4 km/s/Mpc 


70.2 km/s/Mpc 


70.2 ± 1.4 km/s/Mpc 


70. 41^ ;^ km/s/Mpc 






0.0455 


0.0455 


0.0458 ± 0.0016 


0.0456 ± 0.0016 




Sic 


0.226 


0.227 


0.229 ± 0.015 


0.227 ± 0.014 






0.1347 


0.1344 


0.1352 ± 0.0036 


0.1349 ± 0.0036 




^rcion 


10.3 


10.5 


10.6 ± 1.2 


10.4 ± 1.2 




to 


13.76 Gyr 


13.78 Gyr 


13.76 ± 0.11 Gyr 


13.75 ± 0.11 Gyr 



However, our numerical simulations show that the linear bias does not give an accurate description of (Qr) or (Tr). In 
fact, breakdown of the lin ear bias is precisely what is expected from the statistics of peaks. From detailed investigations 
of the statistics of peaks, iDesiacqu^ (|2008l ) found the following scale-dependent bias: 

^pk(n) = [b. - 6c (5? + dl)] AT(n). (B2) 

While the first, constant term bi, has been known for a long time (jKaiserill984l : iBardeen et al.lll986( ). the second term 
6^ has been recognized only recently. The presence of fe^ is expected because, to define peaks, one needs to use the 
information on the first and second derivatives of AT. As the first derivative must vanish at the locations of peaks, 
t he above equation does not contain the first derivative. 
iDesiacaiiea (|2008[ ) has derived the explicit forms of b„ and 6^- 

1 iy--fu 1 u-ji^ 
by^—- J, 6c = — J, (B3) 

(To 1 - 7 (72 I- T 

where v = AT/ao, 7 = crJ/(crocr2), ctj is the r.m.s. of jth derivatives of the temperature fluctuation: 

= i E(2^ + + ^)ycriwfr, (b4) 

and is the harmonic transform of a window function (which is a combination of the experimental beam, pixel 
window, and any other additional smoothing applied to the temperature data). The quantity u is called the "mean 
curvature," and is given by -0 = 6*1(7, 7i')/Go(7, 71^), where 



exp 



{x- 



Gnil^x.)^ I dxx-fix)^^^:^. (B5) 
Jo V 27^(1 - 7 ) 

While iDesiacauei (|2008[ ) applied this formalism to a 3-dimensional Ga ussian random field, it is straightforward to 
generalize his results to a 2-dimensional case, for which f{x) is given by (jBond fc EfstathioulllQSTt) . 

fix) = - 1 + exp(-a;2). (B6) 

With the bias given by equation (|B2[) . we find 

(Jpk(n)Q.(n + 0))= j ^W^Wfib^ + b^P) 

X {Q™cos[2(<?!)-(p)] -|-Q™sin[2(0-(^)]}e''-^, (B7) 

(2^)2 

X {C,™ sin[2((/) - if )] - C;™ cos[2(0 - e'''^, (B8) 

where Wf and Wf are spherical harmonic transforms of the smoothing functions applied to the temperature and 
polarization data, respectively. Recalling \ - 6 = 19 cosicj) — if), J^^ d(^sin[2(0 — tp)]e''^'^°^i't>-'P) — q, and 

.1^ 277 



(Jpk(n)C/,(n + 6))= - I Wfw[{b, + bcl") 
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with m = 2, '4> = ip— (f) — tt/2 and a = —<j) — 7r/2, we find 

(^pk(n)Q.(n + 0)) 

= -/ ^■^Wrwf{K + b,f)CrJ2{l9l (BIO) 
(5pk(n)C/,(n + 0)) 

= -/ ^-^w^wf{K + b,i')cruie)- (Bii) 

Using these results in equations © and ([TU| . we finaUy obtain the desired formulae for the stacked polarization 
profiles: 

{Qrm = - j ^■^w^wf{h^ + b^P)cruie). (B12) 

{Urm^- j ^^wrwr{K+h,p)crj2m. (bis) 

Incidentally, the stacked profile of the temperature fluctuation can also be calculated in a similar way: 

{Tm = J ^^{wn\K+ki')crJom. (bm) 

B.2. A Cookbook for Computing {Qr){d) and {Ur){0) 
How can we evaluate equations (|B12p - (|B14p ? One may follow the following steps: 

1. Compute (To, cti, and (T2 from equation (IB4p . For example, the WMAP 5-year best-fitting temperature power 
spectrum for a power-law ACDM model (|Dunkley et al.l [2009rF^ . multiplied by a Gaussian smoothing of 0.5° 
fuU-width-at-half-maximum (FWHM) and the pixel window function for the HEALPix resolution of TVsidc = 512, 
gives ao = 87.9 fjK, ai = 1.16 x 10^ fiK, and (72 = 2.89 x 10^ ^K. 

2. Compute 7 — cri/{(jQa2). For the above example, we find 7 = 0.5306. 

3. As we need to integrate over peak heights we need to compute the functions Go (7; 7*^) a-nd 6*1(7, 7^^) for various 
values of v. The former function, 6*0(7, 7^)i ca n be found analytically (see equation (A1.9) of lBond fc Efstathioul 
[l987i' ) . For Gi, we need to integrate equation (jB5[) numerically. 

4. Compute u = Gi/Gq. For the above example, we find u = 1.596, 1.831, 3.206, and 5.579 for v — 0^1, 5, and 10, 
respectively. 

5. Choose a thr eshold peak height Vu and compute the mean surface number density of peaks, npk, from equa- 
tion (A1.9) of IBond fc Efstathioul (Il987h : 

"■^^("^^ = (2.r)3/2(2a^) / ^"'''^'Go(7,7-). (B15) 

The integration boundary is taken from vt to +cx) for temperature hot spots, and from —00 to —\i't\ for temper- 
ature cold spots. For the above example, we find Airn-py — 15354.5, 8741.5, 2348.9, and 247.5 for vt = 0, 1, 2, 
and 3, respectively. 

6. Compute bi, and b^ from equation (jB3|) for various values of v. 

7. Average b^, and 5^ over v. We calculate the averaged bias parameters, bi, and 6^, by integrating bi, and 6^ 
multiplied by the number density of peaks for a given ly. We then divide the integral by the mean number 
density of peaks, npk, to find 

'^•^^^^^Tr^mT^ / e-'^'^'Go(7,7i^)&., (B16) 
npk(i^t) (27r)-^/^(2crf) J 



1 a2 



2 



npk(i^t) (2^)3/2(2^2) 



diy e-''''^^Goij,-fv)bc. (B17) 



The integration boundary is taken from i^t to +00 for temperature hot spots, and from —00 to — li^t] for tem- 
perature cold spots. For the above example, we find (6,y,fc^) = (3.289 x 10^^^ 6.039 x 10^''), (1.018 x 10^2^ 
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We used the 5-year best-fitting power spectrum to calculate u4--j\ j i-tii^T l i- j-t 

, , , ■ J, /I r ^r j wcre obtamed) and compare it to the 7- year polarization data, 

the predicted polarization pattern (berore the 7- year parameter / r 
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5.393 X 10-^), (2.006 x 10"^ 4.569 x lO"^), and (3.128 x lO^^, 3.772 x IQ-^) for = 0, 1, 2, and 3, respectively 
(all in units of /iK^^). The larger the peak height is, the larger the linear bias and the smaller the scale-dependent 
bias becomes. 



8. Use and in equations (|B12[) and (|B13p to compute {Qr}{d) and {Ur){9) for a given set of C™ and C'^ 
respectively. 



Very roughly speaking, the bias takes on the following values: 



0.3 



100 

3 

100 fj.K 



1 



' I -\ 



(for lyt = 0) 
(for i^t ~ 3) 



(B18) 



The scale dependence of bias becomes important at ^ ~ 75 for = 0, but the higher peaks are closer to having a linear 
bias on large scales. One may also rewrite this using the stacked temperature values at the center, (T)(0) = (107.0, 
151.4, 216.4, 292.1) for vt = 0, 1, 2, and 3: 



(0.35,1.5,4.3,9.1) 
(107, 151,216,292)^K 



I 



(74,137,219, 



(B19) 



C. OPTIMAL ESTIMATOR FOR SZ STACKING 
C.l. Optimal estimator 

In this Appendix, we describe an optimal likelihood-based method for estimating the stacked SZ profile around 
clusters whose locations are taken from external catalogs. 

Formally, we can set up the problem as follows. We represent the WMAP data as a vector of length (A'^chan x iVpix) 
and denote it by d^^p, where the index v = 1, ■ ■ ■ , A'chan ranges over channels, and the index p = 1, • ■ ■ , Apix ranges 
over sky pixels. (We typically take A^chan = 6 corresponding to VI, V2, Wl, W2, W3, and W4; and A^pix = 12{2^"f 
corresponding to a HEALPix resolution of A'sidc = 1024.) We model the WMAP data as a sum of CMB, noise, and 
SZ contributions as follows: 

Wtmpl 



tm 



+ Pa{ta) 
1 



(CI) 



In this equation, we have written the SZ contribution as a sum of A'tmpi template maps, ii, • • • , ^ATtmpu whose coefficients 
Pa are free parameters to be determined. The operator, A^p^im, in Eq. (IC1[) converts a harmonic-space CMB realization, 
airm into a set of maps with black-body frequency dependence and channel- dependent beam convolution. More 
formally, the matrix element A^p^im is defined by 



Aup,em = b^lYirnip), 



(C2) 



where h^^ is the beam transfer function (including HEALPix window function) for channel v. 

The specific form of the template maps, ^q, will depend on the type of profile reconstruction which is desired. For 
example, if we want to estimate a stacked amplitude for the SZ signal in A^ angular bins, we define one template for 
each bin. If the bin corresponds to angular range ^min ^ & ^ ^max, we define a map nip which is =1 if the angular 



distance 9 between pixel p and some galaxy cluster in the catalog is in the range 9^ 



<9< 



and zero otherwise. 



We convolve this map with the beam in each channel v and multiply by the SZ frequency dependence to obtain the 
template t^p. As another example, if we want to fit for an overall multiple of a fiducial model nip for the total SZ 
signal (summed over all clusters) then we define a single (i.e. Aftmpi = 1) template t^p by applying beam convolution 
and the SZ frequency dependence for each channel v. 

Given this setup, we would like to compute the likelihood function L[pct|c?,yp] for the profile Pa, given the noisy data 
d^p, marginalizing over the CMB realization. We assume a fixed fiducial model Ce and represent the CMB signal 
covariance by a (diagonal) matrix 5'harm in harmonic space: 

{Sharm) em,e'm' — CiSe£'S„im' ■ 

We represent the noise covariance by an (also diagonal) pixel-matrix Nypypi: 

The joint (CMB,SZ) likelihood function can now be written (up to an overall normalizing constant): 



(C3) 



(C4) 



L[a,p|(i] cx exp 



-{d-Aa- patafN~\d -Aa- p^ta) 



(C5) 
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(In this equation we have omitted some indexes for notational compactness, e.g. d^^p — > d and aim — ^ a- The summation 
over a is assumed.) We can now integrate out the CMB reahzation a to obtain the marginahzed hkehhood for the 
profile: 

L[p|d] = j Dah[a,p\d\ (C6) 
I 



(X exp 



(C7) 



where we have defined the (A^tmpi)-t)y-(A^tmpi) matrix 

[iVpi. + A^harm^' Vp'^r^pit^U, (C8) 

and the length-(A^tinpi) vector 

Pa - F-^{t/3h'p'Wpi^ + ^^harm^^];v,,prf.p. (C9) 

The hkehhood function L[p|c?] in Eq. (|C7p has a simple interpretation. The likelihood for pa is a Gaussian with mean 
Pa and covariance matrix F~p. This is the main result of this section, and is the basis for all our SZ results in the 
body of the paper. For example, when we reconstruct the stacked SZ profile in angular bins, the estimated profile in 
each bin a is given by pa and the la error is given by {F~^)aa- 

It is worth mentioning that the statistic Pa also app ears naturally if we use an estimator framework rather than a 
likelihood formalism. If we think oi pa, defined by Eq. (|C9|) . as an estimator for the profile given the data d, then one 
can verify that the estimator is unbiased (i.e. (pa) — Pa, where the expectation value is taken over random CMB + 
noise realizations with a fixed SZ contribution) and its covariance is F^^^ . Conversely, it is not hard to show that pa 
is the unbiased estimator with minimum variance, thus obtaining pa in a different way. This alternate derivation also 
shows that the error bars on the profile obtained in our likelihood formalism are the same as would be obtained in a 
direct Monte Carlo treatment. 

Either from the likelihood or estimator formalism, one sees that the statistic pa is optimal. In the limit where 
all frequency channels are in the Rayleigh- Jeans regime, the statistic pa corresponds to -filtering the data and 
multiplying by each template map. In this case, the filter acts as a high-pass filter which optimally suppresses 
CMB power on scales larger than the clusters, and also optimally weights the channels (in a way which is ^-dependent if 
the beams differ). When channels with higher frequency are included, the statistic pa would get most of its information 
from linear combinations of channels which contain zero CMB signal, but nonzero response to an SZ signal. (Such a 
combination of channels does not need to be high-pass filtered, increasing its statistical weight.) 

For the V-^W combination in WMAP, the N'^ -filtered (V- W) null map is used to separate the SZ effect and CMB, 
as CMB is canceled in this map while the SZ is effect not. 

We conclude with a few comments on implementation. Inspection of Eqs. (jC8|) and (|C9p shows that it would be 
straightforward to compute Fap and Pa , given an algorithm for multiplying a set of iVchan pixel space maps d^p by the 
operat or [A^pix + ASharm^"^]^^- A fast multigrid-based algorith m for this inv e rse pr oblem was found in iSmith et al.l 
(I2007D but there is one small wrinkle in the implementation: in ISmith et al.l (120071 ) the problem was formulated in 
harmonic space and an algorithm was given for multiplying by the operator [S^^j.^-\- -AJ" N~-^^A\^^ . However, the matrix 
identity 

[A^pix + A5ha™A^]-^ = N^,l A[5,7,i,„ + A^N^.^AY'A', (CIO) 

allows us to relate the two inverse problems. In fact there is another advantage to using the expression on the RHS 
of Eq. (|C10|) : because the inverse noise, N~^, appears instead of the noise covariance A'pix, a galactic mask can be 
straightforwardly included in the analysis by zeroing the matrix entries of N^^^^ which correspond to masked pixels, so 
that the pixels are treated as infinitely noisy. 

D. PRESSURE PROFILES 

D.l. Pressure profile from X-ray observations 

Recently, lArnaud et al.l (|2010( ) found that the following parametrized pheno menological electro n pressure profile, 
which is based on a "generalized Navarro-Frenk- White profile" proposed by iNagai et al.l (I2007D. fits t he electron 
pressure profiles directly derived from X-ray data of clusters well (see equation (13) of lArnaud eraLllMoh : 



Pe{r)^im {h/0.7f eV cm 



-3 



£;8/3(z) 



M; 



500 



2/3+Qp 



p(r-A5oo), (Dl) 



3 X 1014(0. 7//i)Mq_ 

1 /2 

where ap = 0.12, E{z) = H{z)/Ho = [rJ™(l -I- zf + ' for a ACDM model, rsoo is the radius within which the 
mean overdensity is 500 times the critical density of the universe, Pc{z) = 2.775 x 1{)^^E'^{z) Mq Mpc"'^, and M500 



WMAP 7-year Cosmological Interpretation 



49 



is the mass enclosed within r^oo- 



The function p{x) is defined by 



500 



47r 

y[500p,(z)]r. 



3 

500- 



p{x) 



8.403(0.7//i)3/2 



(C500X)^[1 + (C500X)"](^-'')/" 



(D2) 



(D3) 



where cgoo = 1.177, a = 1.051, /3 = 5.4905, and 7 = 0.3081. 

The SPT collaboration stacked the SZ maps of 11 known cl usters and fitted t he stacked SZ radial profile to the 
above form, finding C500 = 1.0, a = 1.0, ^ = 5.5, and 7 = 0.5 (|Plagge et al.l[2010l ). While they did not compare the 
overall amplitude (which is the focus of our analysis) , the shape of the press ure profile fou nd by the SPT collaboration 
(using the SZ data) is in an excellent agreement with that found by Arnaud ~et al.l ()2010[) (using the X-ray data). 

D.2. Pressure profile from hydrostatic equilibrium 

The KS profile builds on and extends the idea originally put forward bv lMakino et al.l ()1998D and lSuto et al.l ()1998D : 
(i) gas is in hydro static equilibrium with gravitational potential given by a Navarro-Frenk- White (NFW) dark matter 
density profile ( Navarro et al.|[T997[ ) and (ii) the equation of state of gas is given by a polytropic form, P (x p'^ . However, 
this m odel still contains two free parameters: a polytropic index 7 and the normalization of P. Komats u fc SeliaU 
(|2001f ) found that an additional, physically reasonable assumption that (iii) the slope of the gas density profile and that 
of the dark matter density profile agree at around the virial radius, uniquely fixes 7, leaving only one free parameter: 
the normalization of P. These assumptions are supported by hydrodynamical simulations of clusters of galaxies, and 
the resulting shape of the KS profile indeed agrees with simulations reasonably well (see, however. Section 17.81 for a 
discussion on the shape of the profile in the outer parts of clusters). 

Determining the normalization of the KS profile requires an additional assumption, described below. Also note that 
this model does not take into account an y non-thermal pre s sure (such as pv^ where v is the bulk or turbulent velocity), 
gas cooling, or star formation (see, e.g., iBode et al.ll2009t iFrederiksen et all 120091 and references therein for various 
attempts to incorporate more gas physics). 

The KS gas pressure profile is given by (see Section 3.3 of iKomatsu &: Seliakll2002l for more detailed descriptions) 

(D4) 

for X = 0.76. Here, r. 



Pgas(r)=Pgas(0)[ygas(r/r,)]^. 

The electron pressure profile, Pe, is then given by Pe = [(2 -I- 2X)/{3 + 5X)]Pgas — 0.518Pgas 
the so-called "scale radius" of the NFW profile, and a function ygas{x) is defined by 



IS 



Vgasix) = <1-B 



1 - 



ln(l + x) 



1/(7-1) 



with 



B = 3t]'\0) 



7-1 



ln(l 



1 + c 



(D5) 

(D6) 
(D7) 
(D8) 

Here, c is the so-called "concentration parameter" of the NFW profile, which is related to the scale radius, Tg, via 
c = r^ii/rs, and r^iv is the virial radius. The virial radius gives the virial mass, Mvir, as 



and 



7 = 1.137 + 8.94 X IQ-^ ln(c/5) - 3.68 x 10-^{c ~ 5), 



?7(0) = 2.235 -I- 0.202(c - 5) - 1.16 x 10-^{c - 5)^ 



M,i, = —[A,{z)p,{z)]rl,^. 



Here, Ac{z) depends on and ^^a as (jBrvan fc Normanlll998f) 

Ac(z) = ISn^ + 82[n{z) - 1] - 39[f7(z) - if, 



(D9) 



(DIO) 



where n{z) = 0,^(1 + z)^ / E'^ {z) (also see lLacev fc Colelll993tlNakamura fc Sutolll997L for other fitting formulae). For 
= 0.277, one finds Ac(0) ~ 98. 
The central gas pressure, Pgas(O), is given by 



-Pgas(O) = 55.0 eV cm" 



Pgas(O) 



1014 Mq Mpc" 



fcBTgas(O) 



8 keV 



where ks is the the Boltzmann constant. The central gas temperature, Tgas(O), is given by 

"Mvir/(10l5 ft-l Mq)" 



kBTgW^) = 8.80 keV 77(0) 



rvir/(l h-^ Mpc) 



(Dll) 



(D12) 
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The central gas density, pgas(O), will be determined such that the gas density at the virial radius is the cosmic mean 
baryon fraction, ilb/rim, times the dark matter density at the same radius. This is an assumption. In fact, the cosmic 
mean merely provides an upper limit on the baryon fraction of clusters, and thus we expect the gas pressure to be less 
than what is given here. How much less needs to be determined from observations (or possibly from more detailed 
modeling of the intraclustcr medium). In any case, with this assumption, we find 

/5gas(0) = 7.96 X 10" Mq Mpc"^ 

\ Afvir/(10i5 h-^ Mq) 

1 1-1 

(D13) 



n„J [rvir/(l h-^ Mpc)]3 



„2 



1 



ln(l + c) 



1 + c 



(1 + C)2 ygas(c) 

This equation fixes a typo in equation (21) of IKomatsu fc SeliakI (j2002D . 

The virial radius, r^ir, is approximately given by 2r5oo; thus, Mvir is approximately given by 8Ac/500 ~ 1.6. However, 
the exact relation depends on the mass (see, e.g.. Figure 1 of iKo matsu & Scliak 2001). We calculate the mass within 
a given radius, r, by integrating the NFW density profile ([Navarro et al.lll997ll : 

PNFw(f) = . , w,^", — i-T^- (^1'*) 
(r/rs)(l + r/rsY 

Specifically, for a given M500 and raoo, we solve the following non- linear equation for Mvir: 

TO(cr50o/rvir) 

-Mvir ■r-\ ^ M5Q0, (D15) 

m(c) 

where m(a:) = ln(l + x) — x/{l + x). Here , rvir is related to Myjr vi a equation (|D9p . We also need a relation between 
the concentration parameter, c, and Mvir- IKomatsu fc Selial^ ()2002[ ) used 

10 / M, 

Cscljak 



l + z V3.42 X 1012 ft-i Mq 
5.09 ' " " " ""-^ 



(D16) 



l + z VlO" h-^ Mq^ 

which was adopted from 'Scliak' (200^. 

Recently, iDuffv et al.l ()2008.) ran larg e iV-body simulations with the WMAP 5-year cosmological parameters to find 
a more accurate fitting formula for the concentration parameter: 



7.85 / Afv 

Cduffy = 



(l + z)0-7i ^2 X 1012 /i-i Mq 

5.72 / Afvir ^ "" "^^ 



(l + z)0-7i VlO" /i-i Mq^ 
This formula makes clusters of galaxies (M© > lOi"! Mq) more concentrated than Csoijak would predict. 



(D17) 
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